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Preface 


As a student, I was rearing at the bit, after a course on quantum mechanics, to 
learn quantum field theory, but the books on the subject all seemed so formidable. 
Fortunately, I came across a little book by Mandl on field theory, which gave 
me a taste of the subject enabling me to go on and tackle the more substantive 
texts. I have since learned that other physicists of my generation had similar good 
experiences with Mandl. 

In the last three decades or so, quantum field theory has veritably exploded and 
Mandl would be hopelessly out of date to recommend to a student now. Thus 
I thought of writing a book on the essentials of modern quantum field theory 
addressed to the bright and eager student who has just completed a course on 
quantum mechanics and who is impatient to start tackling quantum field theory. 

I envisaged a relatively thin book, thin at least in comparison with the many 
weighty tomes on the subject. I envisaged the style to be breezy and colloquial, 
and the choice of topics to be idiosyncratic, certainly not encyclopedic. I envisaged 
having many short chapters, keeping each chapter “bite-sized.” 

The challenge in writing this book is to keep it thin and accessible while at the 
same time introducing as many modern topics as possible. A tough balancing act! 
In the end, I had to be unrepentantly idiosyncratic in what I chose to cover. Note to 
the prospective book reviewer: You can always criticize the book for leaving out 
your favorite topics. I do not apologize in any way, shape, or form. My motto in 
this regard (and in life as well), taken from the Ricky Nelson song “Garden Party,” 
is “You can’t please everyone so you gotta please yourself.” 

This book differs from other quantum field theory books that have come out in 
recent years in several respects. 

I want to get across the important point that the usefulness of quantum field 
theory is far from limited to high energy physics, a misleading impression my 
generation of theoretical physicists were inculcated with and which amazingly 
enough some recent textbooks on quantum field theory (all written by high energy 
physicists) continue to foster. For instance, the study of driven surface growth 
provides a particularly clear, transparent, and physical example of the importance 
of the renormalization group in quantum field theory. Instead of being entangled 
in all sorts of conceptual irrelevancies such as divergences, we have the obviously 
physical notion of changing the ruler used to measure the fluctuating surface. 
Other examples include random matrix theory and Chern-Simons gauge theory 
in quantum Hall fluids. I hope that condensed matter theory students will find this 
book helpful in getting a first taste of quantum field theory. The book is divided 
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into eight parts,! with two devoted more or less exclusively to condensed matter 
physics. 

I try to give the reader at least a brief glimpse into contemporary developments, 
for example, just enough of a taste of string theory to whet the appetite. This book is 
perhaps also exceptional in incorporating gravity from the beginning. Some topics 
are treated quite differently than in traditional texts. I introduce the Faddeev-Popov 
method to quantize electromagnetism and the language of differential forms in 
developing Yang-Mills theory, for example. 

The emphasis is resoundingly on the conceptual rather than the computational. 
The only calculation I carry out in all its gory details is that of the magnetic 
moment of the electron. Fortunately, many excellent texts focus on calculational 
techniques. Throughout, specific examples rather than heavy abstract formalism 
will be favored. Instead of dealing with the most general case, I always opt for the 
simplest. 

I had to struggle constantly between clarity and wordiness. In trying to antic- 
ipate and to minimize what would confuse the reader, I often find that I have to 
belabor certain points more than what I would like. 

I tried to avoid the dreaded phrase “It can be shown that...” as much as 
possible. Otherwise, I could have written a much thinner book than this! There are 
indeed thinner books on quantum field theory: I looked at a couple and discovered 
that they hardly explain anything. I must confess that I have an almost insatiable 
desire to explain. 

As the manuscript grew, the list of topics that I reluctantly had to drop also kept 
growing. So many beautiful results, but so little space! It almost makes me ill to 
think about all the stuff (bosonization, instanton, conformal field theory, etc., etc.) 
Thad to leave out. As one colleague remarked, the nutshell is turning into a coconut 
shell! 

Shelley Glashow once described the genesis of physical theories: “Tapestries 
are made by many artisans working together. The contributions of separate workers 
cannot be discerned in the completed work, and the loose and false threads have 
been covered over.” I regret that other than giving a few tidbits here and there I 
could not go into the fascinating history of quantum field theory, with all its defeats 
and triumphs. On those occasions when I refer to original papers I suffer from that 
disconcerting quirk of human psychology of tending to favor my own more than 
decorum might have allowed. I certainly did not attempt a true bibliography. 

The genesis of this book goes back to the quantum field theory course I taught as 
a beginning assistant professor at Princeton University. I had the enormous good 
fortune of having Ed Witten as my teaching assistant and grader. Ed produced 
lucidly written solutions to the homework problems I assigned, to the extent that 
the next year I went to the chairman to ask “What is wrong with the TA I have 
this year? He is not half as good as the guy last year!” Some colleagues asked me 


"Murray Gell-Mann used to talk about the eightfold way to wisdom and salvation in 
Buddhism (M. Gell-Mann and Y. Ne’eman, The Eightfold Way). Readers familiar with 
contemporary Chinese literature would know that the celestial dragon has eight parts. 
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to write up my notes for a much needed text (those were the exciting times when 
gauge theories, asymptotic freedom, and scores of topics not to be found in any 
texts all had to be learned somehow) but a wiser senior colleague convinced me that 
it might spell disaster for my research career. Decades later, the time has come. I 
particularly thank Murph Goldberger for urging me to turn what expository talents 
I have from writing popular books to writing textbooks. It is also a pleasure to say 
a word in memory of the late Sam Treiman, teacher, colleague, and collaborator, 
who as a member of the editorial board of Princeton University Press persuaded me 
to commit to this project. I regret that my slow pace in finishing the book deprived 
him of seeing the finished product. 

Over the years I have refined my knowledge of quantum field theory in discus- 
sions with numerous colleagues and collaborators. As a student, I attended courses 
on quantum field theory offered by Arthur Wightman, Julian Schwinger, and Sid- 
ney Coleman. I was fortunate that these three eminent physicists each has his own 
distinctive style and approach. 

The book has been tested “in the field” in courses I taught. I used it in my field 
theory course at the University of California at Santa Barbara, and I am grateful 
to some of the students, in particular Ted Erler, Andrew Frey, Sean Roy, and Dean 
Townsley, for comments. I benefitted from the comments of various distinguished 
physicists who read all or parts of the manuscript, including Steve Barr, Doug 
Eardley, Matt Fisher, Murph Goldberger, Victor Gurarie, Steve Hsu, Bei-lok Hu, 
Clifford Johnson, Mehran Kardar, Ian Low, Joe Polchinski, Arkady Vainshtein, 
Frank Wilczek, Ed Witten, and especially Joshua Feinberg. Joshua also did many 
of the exercises. 

Talking about exercises: You didn’t get this far in physics without realizing 
the absolute importance of doing exercises in learning a subject. It is especially 
important that you do most of the exercises in this book, because to compensate for 
its relative slimness I have to develop in the exercises a number of important points 
some of which I need for later chapters. Solutions to some selected problems are 
given. 

I will maintain a web page http://theory.kitp.ucsb.edu/~zee/nuts.html listing 
all the errors, typographical and otherwise, and points of confusion that will 
undoubtedly come to my attention. 

I thank my editors, Trevor Lipscombe, Joe Wisnovsky, and Sarah Green and 
the staff of Princeton Editorial Associates (particularly Cyd Westmoreland and 
Evelyn Grossberg) for their advice and for seeing this project through. Finally, I 
thank Peter Zee for suggesting the cover painting and Catherine Zee for playing 
Mozart sonatas while I proofread. 


Convention, Notation, and Units 


For the same reason that we no longer use a certain king’s feet to measure distance, 
we use natural units in which the speed of light c and the Dirac symbol fi are 
both set equal to 1. Planck made the profound observation that in natural units 
all physical quantities can be expressed in terms of the Planck mass Mpjanck = 
1/./GNewton  10!°Gev. The quantities c and fi are not so much fundamental 
constants as conversion factors. In this light, Iam genuinely puzzled by condensed 
matter physicists carrying around Boltzmann’s constant k, which is no different 
from the conversion factor between feet and meters. 

Spacetime coordinates x“ are labeled by Greek indices (u = 0, 1, 2, 3) with 
the time coordinate x° sometimes denoted by t. Space coordinates x! are labeled 
by Latin indices (i = 1,2, 3 ) and 0,, = 0/dx" . We use a Minkowski metric 
n¥” with signature (+, —,—, — ) so that n° —+1. We write nd,99,9 = 
0,pd4p = (dp)* = (8p/dt)* — >; (@p/dx')*. The metric in curved spacetime is 
always denoted by g“”, but often I will also use g“” for the Minkowski metric 
when the context indicates clearly that we are in flat spacetime. 

Since I will be talking mostly about relativistic quantum field theory in this 
book I will without further clarification use a relativistic language. Thus, when I 
speak of momentum, unless otherwise specified, | mean energy and momentum. 
Also since i = 1, I will not distinguish between wave vector k and momentum, 
and between frequency and energy. 

In local field theory I deal primarily with the Lagrangian density £ and not 
the Lagrangian L = f d3x &£ . As is common practice in the literature and in oral 
discussion, I will often abuse terminology and simply refer to £ as the Lagrangian. 
I will commit other minor abuses such as writing 1 instead of J for the unit matrix. 
We use the same symbol g for the Fourier transform y(k) of a function g(x) 
whenever there is no risk of confusion, as is almost always the case. I prefer an 
abused terminology to cluttered notation and unbearable pedantry. 

The symbol « denotes complex conjugation, and ¢ hermitean conjugation: The 
former applies to a number and the latter to an operator. I also use the notation 
c.c. and h.c. Often when there is no risk of confusion I abuse the notation, using + 
when | should use «. For instance, in a path integral, bosonic fields are just number- 
valued fields, but nevertheless I write g' rather than y*. For a matrix M, then of 
course M? and M* should be carefully distinguished from each other. 

I made an effort to get factors of 2 and right, but some errors will be inevitable. 
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QUANTUM FIELD THEORY IN A NUTSHELL 


PART I 
MOTIVATION AND FOUNDATION 


Chapter I.1 
Who Needs It? 


Who needs quantum field theory? 


Quantum field theory arose out of our need to describe the ephemeral nature of 
life. 

No, seriously, quantum field theory is needed when we confront simultaneously 
the two great physics innovations of the last century of the previous millennium: 
special relativity and quantum mechanics. Consider a fast moving rocket ship close 
to light speed. You need special relativity but not quantum mechanics to study its 
motion. On the other hand, to study a slow moving electron scattering on a proton, 
you must invoke quantum mechanics, but you don’t have to know a thing about 
special relativity. 

It is in the peculiar confluence of special relativity and quantum mechanics that 
a new set of phenomena arises: Particles can be born and particles can die. It is 
this matter of birth, life, and death that requires the development of a new subject 
in physics, that of quantum field theory. 

Let me give a heuristic discussion. In quantum mechanics the uncertainty 
principle tells us that the energy can fluctuate wildly over a small interval of time. 
According to special relativity, energy can be converted into mass and vice versa. 
With quantum mechanics and special relativity, the wildly fluctuating energy can 
metamorphose into mass, that is, into new particles not previously present. 

Write down the Schrédinger equation for an electron scattering off a proton. 
The equation describes the wave function of one electron, and no matter how you 
shake and bake the mathematics of the partial differential equation, the electron 
you follow will remain one electron. But special relativity tells us that energy 
can be converted to matter: If the electron is energetic enough, an electron and a 
positron (“the antielectron”) can be produced. The Schriédinger equation is simply 
incapable of describing such a phenomenon. Nonrelativistic quantum mechanics 
must break down. 

You saw the need for quantum field theory at another point in your education. 
Toward the end of a good course on nonrelativistic quantum mechanics the inter- 
action between radiation and atoms is often discussed. You would recall that the 
electromagnetic field is treated as a field; well, it is a field. Its Fourier components 
are quantized as a collection of harmonic oscillators, leading to creation and an- 
nihilation operators for photons. So there, the electromagnetic field is a quantum 
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Figure 1.1.1 


field. Meanwhile, the electron is treated as a poor cousin, with a wave function 
W(x) governed by the good old Schrédinger equation. Photons can be created or 
annihilated, but not electrons. Quite aside from the experimental fact that electrons 
and positrons could be created in pairs, it would be intellectually more satisfying 
to treat electrons and photons, as they are both elementary particles, on the same 
footing. 

So, Iwas more or less right: Quantum field theory is a response to the ephemeral 
nature of life. 

All of this is rather vague, and one of the purposes of this book is to make 
these remarks more precise. For the moment, to make these thoughts somewhat 
more concrete, let us ask where in classical physics we might have encountered 
something vaguely resembling the birth and death of particles. Think of a mattress, 
which we idealize as a 2-dimensional lattice of point masses connected to each 
other by springs (Fig. I.1.1) For simplicity, let us focus on the vertical displacement 
[which we denote by q,(¢)] of the point masses and neglect the small horizontal 
movement. The index a simply tells us which mass we are talking about. The 
Lagrangian is then 


L= 5(>) mq? ~~ > kb9a4b ~~ > Sabc4@ad bc — °° ‘) (1) 
a a,b 


a,byc 


Keeping only the terms quadratic in g (the “harmonic approximation”) we have 
the equations of motion mg, = — }°, ka,q,- Taking the q’s as oscillating with 
frequency w, we have >~,, kanqy = mw"q,. The eigenfrequencies and eigenmodes 
are determined, respectively, by the eigenvalues and eigenvectors of the matrix 
k. As usual, we can form wave packets by superposing eigenmodes. When we’ 
quantize the theory, these wave packets behave like particles, in the same way that 
electromagnetic wave packets when quantized behave like particles called photons. 
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Since the theory is linear, two wave packets pass right through each other. But 
once we include the nonlinear terms, namely the terms cubic, quartic, and so forth 
in the q’s in (1), the theory becomes anharmonic. Eigenmodes now couple to each 
other. A wave packet might decay into two wave packets. When two wave packets 
come near each other, they scatter and perhaps produce more wave packets. This 
naturally suggests that the physics of particles can be described in these terms. 

Quantum field theory grew out of essentially these sorts of physical ideas. 

It struck me as limiting that even after some 75 years, the whole subject of 
quantum field theory remains rooted in this harmonic paradigm, to use a dreadfully 
pretentious word. We have not been able to get away from the basic notions of 
oscillations and wave packets. Indeed, string theory, the heir to quantum field 
theory, is still firmly founded on this harmonic paradigm. Surely, a brilliant young 
physicist, perhaps a reader of this book, will take us beyond. 


Condensed matter physics 


In this book I will focus mainly on relativistic field theory, but let me mention 
here that one of the great advances in theoretical physics in the last 30 years 
or so is the increasingly sophisticated use of quantum field theory in condensed 
matter physics. At first sight this seems rather surprising. After all, a piece of 
“condensed matter’ consists of an enormous swarm of electrons moving nonrel- 
ativistically, knocking about among various atomic ions and interacting via the 
electromagnetic force. Why can’t we simply write down a gigantic wave function 
W(x}, X2, +++, Xy), where x,; denotes the position of the jth electron and N is 
a large but finite number? Okay, WV is a function of many variables but it is still 
governed by a nonrelativistic Schrédinger equation. 

The answer is yes, we can, and indeed that was how solid state physics was first 
studied in its heroic early days, (and still is in many of its subbranches.) 

Why then does a condensed matter theorist need quantum field theory? Again, 
let us first go for a heuristic discussion, giving an overall impression rather than 
all the details. In a typical solid, the ions vibrate around their equilibrium lattice 
positions. This vibrational dynamics is best described by so-called phonons, which 
correspond mote or less to the wave packets in the mattress model described above. 

This much you can read about in any standard text on solid state physics. 
Furthermore, if you have had a coutse on solid state physics, you would recall that 
the energy levels available to electrons form bands. When an electron is kicked 
(by a phonon field say) from a filled band to an empty band, a hole is left behind 
in the previously filled band. This hole can move about with its own identity as a 
particle, enjoying a perfectly comfortable existence until another electron comes 
into the band and annihilates it. Indeed, it was with a picture of this kind that Dirac 
first conceived of a hole in the “electron sea” as the antiparticle of the electron, the 
positron. 

We will flesh out this heuristic discussion in subsequent chapters. 
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Marriages 


To summarize, quantum field theory was born of the necessity of dealing with the 
marriage of special relativity and quantum mechanics, just as the new science of 
string theory is being born of the necessity of dealing with the marriage of general 
relativity and quantum mechanics. 


Chapter I.2 


Path Integral Formulation 
of Quantum Physics 


The professor’s nightmare: a wise guy in the class 


As I noted in the preface, I know perfectly well that you are eager to dive into 
quantum field theory, but first we have to review the path integral formalism 
of quantum mechanics. This formalism is not universally taught in introductory 
courses on quantum mechanics, but even if you have been exposed to it, this chapter 
will serve as a useful review. The reason I start with the path integral formalism 
is that it offers a particularly convenient way of going from quantum mechanics 
to quantum field theory. I will first give a heuristic discussion, to be followed by a 
more formal mathematical treatment. 

Perhaps the best way to introduce the path integral formalism is by telling a 
story, certainly apocryphal as many physics stories are. Long ago, in a quantum 
mechanics class, the professor droned on and on about the double-slit experiment, 
giving the standard treatment. A particle emitted from a source S (Fig. 1.2.1) at time 
t = 0 passes through one or the other of two holes, A; and A), drilled in a screen 
and is detected at time tf = T by a detector located at O. The amplitude for detection 
is given by a fundamental postulate of quantum mechanics, the superposition 
principle, as the sum of the amplitude for the particle to propagate from the source 
S through the hole A, and then onward to the point O and the amplitude for the 
particle to propagate from the source S through the hole Ay and then onward to 
the point O. 

Suddenly, a very bright student, let us call him Feynman, asked, “Professor, 
what if we drill a third hole in the screen?” The professor replied, “Clearly, the 
amplitude for the particle to be detected at the point O is now given by the sum 
of three amplitudes, the amplitude for the particle to propagate from the source § 
through the hole A, and then onward to the point O, the amplitude for the particle 
to propagate from the source S through the hole A, and then onward to the point 
O, and the amplitude for the particle to propagate from the source S through the 
hole A3 and then onward to the point O.” 

The professor was just about ready to continue when Feynman interjected again, 
“What if I drill a fourth and a fifth hole in the screen?” Now the professor is visibly 
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losing his patience: “All right, wise guy, I think it is obvious to the whole class that 
we just sum over all the holes.” 

To make what the professor said precise, denote the amplitude for the particle 
to propagate from the source S through the hole A; and then onward to the point 
O as A(S — A; — QO). Then the amplitude for the particle to be detected at the 
point O is 


A(detected at O)= )° A(S > A; > O) (1) 


i 


But Feynman persisted, “What if we now add another screen (Fig. 1.2.2) with 
some holes drilled in it?” The professor was really losing his patience: “Look, can’t 
you see that you just take the amplitude to go from the source S to the hole A; in 
the first screen, then to the hole B j in the second screen, then to the detector at O, 
and then sum over all i and 7?” 

Feynman continued to pester, “What if I put in a third screen, a fourth screen, 
eh? What if I put in a screen and drill an infinite number of holes in it so that the 
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screen is no longer there?” The professor sighed, “Let’s move on; there is a lot of 
material to cover in this course.” 

But dear reader, surely you see what that wise guy Feynman was driving at. 
I especially enjoy his observation that if you put in a screen and drill an infinite 
number of holes in it, then that screen is not really there. Very Zen! What Feynman 
showed is that even if there were just empty space between the source and the 
detector, the amplitude for the particle to propagate from the source to the detector 
is the sum of the amplitudes for the particle to go through each one of the holes 
in each one of the (nonexistent) screens. In other words, we have to sum over the 
amplitude for the particle to propagate from the source to the detector following 
all possible paths between the source and the detector (Fig. [-2.3). 


A (particle to go from S$ to O in time T) = 


> A (particle to go from S to O in time T following a particular path) (2) 
(paths) 


Now the mathematically rigorous will surely get anxious over how > (paths) is 
to be defined. Feynman followed Newton and Leibniz: Take a path (Fig. 1.2.4), 
approximate it by straight line segments, and let the segments go to zero. You can 
see that this is just like filling up a space with screens spaced infinitesimally close 
to each other, with an infinite number of holes drilled in each screen. 
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Fine, but how to construct the amplitude A (particle to go from S to O in time T 
following a particular path)? Well, we can use the unitarity of quantum mechanics: 
If we know the amplitude for each infinitesimal segment, then we just multiply 
them together to get the amplitude of the whole path. 

In quantum mechanics, the amplitude to propagate from a point g; to a point g 
in time T is governed by the unitary operator e~'47 where H is the Hamiltonian. 
More precisely, denoting by |g) the state in which the particle is at g, the amplitude 
in question is just (gp|e '#7 |g;). Here we are using the Dirac bra and ket 
notation. Of course, philosophically, you can argue that to say the amplitude is 
(qrle ‘#7 |q;) amounts to a postulate and a definition of H. It is then up to 
experimentalists to discover that H is hermitean, has the form of the classical 
Hamiltonian, et cetera. 

Indeed, the whole path integral formalism could be written down mathemat- 
ically starting with the quantity (q¢-|e~'”7 |q,), without any of Feynman’s jive 
about screens with an infinite number of holes. Many physicists would prefer a 
mathematical treatment without the talk. As a matter of fact, the path integral for- 
malism was invented by Dirac precisely in this way, long before Feynman. 

A necessary word about notation even though it interrupts the narrative flow: We 
denote the coordinates transverse to the axis connecting the source to the detector 
by q, rather than x, for a reason which will emerge in a later chapter. For notational 
simplicity, we will think of g as 1-dimensional and suppress the coordinate along 
the axis connecting the source to the detector. 


Dirac’s formulation 


Let us divide the time T into N segments each lasting 6t = T/N. Then we write 


-iHT 1 HSt ,—t Ht .. e i Ms 


(qrle lar) = (grle lq7) 


Now use the fact that |g) forms a complete set of states so that f dq |q){q| =1. 
Insert 1 between all these factors of e~?”*' and write 


—iHT 


(qrle la7) 


N-1 
=({] J aajarie™ lan—1)(an—11e 8 ayo) + -- 
j=l 


- qgle 4 ai) (qyle 2 a7) " (3) 


Focus on an individual factor (q;,,|e7'””*' |q;). Let us take the baby step 
of first evaluating it just for the free-particle case in which H = p*/2m. The 
hat on p reminds us that it is an operator. Denote by |p) the eigenstate of p, 
namely p |p) = p |p). Do you remember from your course in quantum mechanics 
that (g|p) =e’??? Sure you do. This just says that the momentum eigenstate is 
a plane wave in the coordinate representation. (The normalization is such that 
f (dp/27)|p)(p| = 1.) So again inserting a complete set of states, we write 
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_j8e¢p2 d _i8e¢p2 
(qjyile (a Pm) lq;) -/ 5, isle fOr 2") |p) (plq;) 


AD _sse¢p? 
= il —e PP a. 1p) (plq;) 
2n 


_ | AP ,—idt(p?/2m) piplaj+1—4)) 
2m 

Note that we removed the hat from the momentum operator in the exponential: 
Since the momentum operator is acting on an eigenstate, it can be replaced by its 
eigenvalue. 

The integral over p is known as a Gaussian integral, with which you may already 
be familiar. If not, turn to Appendix 1 to this chapter. 

Doing the integral over p, we get 


1 
~idt(p2/2m) —i2am \? img; 4;—9;)21/26t 
( . e idt(p* } . ( ) e iti 4j 
qjoil lqj) Bt 


i 
_ (2) 2 ei dt (m/2LGj41-9))/8P 
t 


Putting this into (3) yields 


r 


—i 1 of 2 pv) * —go- bt} 
(qple'"" Iq7) =( faae' (m /2) Diy [@j41-4j)/8t] 


with gg = q; and gy =qr. 

We can now go to the continuum limit 5¢ —- 0. Newton and Leibniz taught us 
to replace [(q;+1 — q;)/6tF by 47, and 6t Yo by fo dt. Finally, we define the 
integral over paths as 


—i2am 7 NA 
Da(t)= li ——_—_ dq.. 
il q(t) Jim. ( bt ) 1 4 


We thus obtain the path integral representation 
. fT tna 
(gnle" Ign) = f Dgteye' bo m4 (4) 


This fundamental result tells us that to obtain (q,|e~'”7 |g;) we simply inte- 
grate over all possible paths g(t) such that q(0) = g; and q(T) = qr. 

As an exercise you should convince yourself that had we started with the 
Hamiltonian for a particle in a potential H = p*/2m + V(g) (again the hat on 
q indicates an operator) the final result would have been 


i . pt lie 
(qrle ‘77 |qz) -| Daq(t) ei do atldmg’—Viq)l 6) 
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We recognize the quantity lmg* — V(q) as just the Lagrangian L(g, q). The 
Lagrangian has emerged naturally from the Hamiltonian. In general, we have 


. pe . 
apie!” \q;) = J Do(t) ei fe Hae 6) 


To avoid potential confusion, let me be clear that t appears as an integration variable 
in the exponential on the right-hand side. The appearance of t in the path integral 
measure Dq(t) is simply to remind us that g is a function of ¢ (as if we need 
reminding). Indeed, this measure will often be abbreviated to Dg. You might recall 
that fo dtL(q, q) is called the action S(q) in classical mechanics. The action S is 
a functional of the function g(t). 

Often, instead of specifying that the particle starts at an initial position g, and 
ends at a final position gr, we prefer to specify that the particle starts in some 
initial state 7 and ends in some final state F . Then we are interested in calculating 
(F|e #7 |7), which upon inserting complete sets of states can be written as 


f dae f dar(Flarviarle™ lar) (arlZ), 


which mixing Schrédinger and Dirac notation we can write as 


[dar | darvear ari lar)'¥7 (47). 


In most cases we are interested in taking |7) and |F') as the ground state, which 
we will denote by |0). It is conventional to give the amplitude (0|e~'””7 |0) the 
name Z. 

At the level of mathematical rigor we are working with, we count on the 


Tr : 
path integral f Dq(t) e So delgmq°—V(q)h to converge because the oscillatory phase 
factors from different paths tend to cancel out. It is somewhat more rigorous to 
perform a so-called Wick rotation to Euclidean time. This amounts to substituting 
t —> —it and rotating the integration contour in the complex ¢ plane so that the 
integral becomes 


r -2 
Z= f Da(ty eho aan ve, ) 


knownas the Euclidean path integral. As is done in Appendix | to this chapter with 
ordinary integrals we will always assume that we can make this type of substitution 
with impunity. 

One particularly nice feature of the path integral formalism is that the classical 
limit of quantum mechanics can be recovered easily. We simply restore Planck’s 
constant ft in (6): 


. T . 
(gpl e7@/M AT lar) -| Da(t) ail) fo ath (q,q) 
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and take the A — 0 limit. Applying the stationary phase or steepest descent method 


rr 
(if you don’t know it see Appendix 2 to this chapter) we obtain eel?) So dE de+4c) 
where g,(t) is the “classical path” determined by solving the Euler-Lagrange 
equation (d/dt)(6L/8q) — (6L/8q) = 0 with appropriate boundary conditions. 


Appendix 1 
I will now show you how to do the integral G = pre dxe~3* , The trick is to square the 
integral, call the dummy integration variable in one of the integrals y, and then pass to polar 
coordinates: 


> +00 1.2 +oo 1.2 +00 4,2 
G -| dx e 2* il dye 2 =a f dr re 2" 
- 0 


Le a) —-OO 
+00 
= 20 [ dwe “=2n 
0 
Thus, we obtain 


tee 1,2 
| dxe 2” =V/2n (8) 
—O 

Believe it or not, a significant fraction of the theoretical physics literature consists 
of performing variations and elaborations of this basic Gaussian integral. The simplest 
extension is almost immediate: 


oe Lox2 2n 3 
J dx e@ 2% = (=) (9) 


as can be seen by scaling x > x/,/a. 
Acting on this repeatedly with —2(d/da) we obtain 
+00 —tax? 2 
dx e@ 2°" x 
2%) = Foo de Px | an —1)(2n —3)-+-5-3-1 (10) 


$00 yy gp gan? a" 


The factor 1/a” follows from dimensional analysis. To remember the factor (2n — I! = 
(2n — 1)(2n —3)---5-3-1 imagine 2n points and connect them in pairs. The first point 
can be connected to one of (2n — 1) points, the second point can now be connected to one of 
the remaining (27 — 3) points, and so on. This clever observation, due to Gian Carlo Wick, 
is known as Wick’s theorem in the field theory literature. Incidentally, field theorists use the 
following graphical mnemonic in calculating, for example, (x®) : Write (x®) as (xxxxxXx) 
and connect the x’s, for example 


(xxx xxx) 


The pattern of connection is known as a Wick contraction. In this simple example, since 
the six x’s are identical, any one of the distinct Wick contractions gives the same value 
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a~> and the final result for (x®) is just a~> times the number of distinct Wick contractions, 
namely 5 -3-1= 15. We will soon come to a less trivial example, in which we have distinct 
x’s, in which case distinct Wick contraction gives distinct values. 

An important variant is the integral 


1 
[~ dx eB +4* — () © el [a (11) 


00 a 


To see this, take the expression in the exponent and “complete the square”: —ax7/2 + Jx = 
—(a/2)(x* — 24 x/a) = —(a/2)(x — J/a)* + J*/2a. The x integral can now be done by 
shifting x + x + J/a, giving the factor of (2n/a)?. Check that we can also obtain (10) by 
differentiating with respect to J repeatedly and then setting J = 0. 

Another important variant is obtained by replacing J by iJ: 


L 
| a er ee (=) * J? /2a (12) 


00 a 


To get yet another variant, replace a by —ia: 


ww 
| TO gx HOH (=)  gnit? [2a (13) 


Let us promote a to a real symmetric N by N matrix A;; and x to a vector x; (i, j = 
1,---, N). Then (11) generalizes to 


/ il a | dxjdxy---dxy eA tT = (= ) eT ATS C4) 
—0o ¥—900 =09 det[ A] 


where x-A-x =x,;A,;x; and J - x = J,x, (with repeated indices summed.) To see this, 
diagonalize A by an orthogonal transformation 0: A= O~!- D - O where D isa diagonal 
matrix. Call y; = O;,;x;. In other words, we rotate the coordinates in the N dimensional 


Euclidean space over which we are integrating. Using 


+00 +00 +00 +00 
[Oo [Pan dige ff ay ay 
—90 OO —0o —o0o 


we factorize the left-hand side of (14) into a product of N integrals of the form in (11). The 
result can then be expressed in terms of D~!, which we write as O - A~'- O—!. (To make 
sure you got it, try this explicitly for N = 2.) 

Putting in some i’s (A — —iA, J + iJ), we find the generalization of (13) 


+00 p+t+oo +00 
| / wee il dx,dxy Las dxn eli/2)e Axti dx 
ma? * —OoO --00O 


1 
- (Sora) e EPA (15) 
Cc 


The generalization of (10) is also easy to obtain. We differentiate (14) with respect to J 
repeatedly and then setting J —> 0. We find 
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(aja jo xx) = DOA a0 (A Yea (16) 
Wick 
where we have defined 


(xix jo + EX) 


+oo p+oo +00 —1y.4. 
-o Joo oe fe dx dx. ---dxye 3* *xjX jo  XEX 7) 
~ ~1y.A. 
[oR PS [TRS daydxy ++ dxy e72*'4* 
and where the set of indices {a, b, --- , c, d} represent a permutation of the set of indices 
{i, j, °°, &, 2}. The sum in (16) is over all such permutations or Wick contractions. It is 


easiest to explain (16) for a simple example (x;x ;x;,x;). We connect the x’s in pairs (Wick 
contraction) and write a factor (A~'),,, if we connect x, to x,. Thus, 


(xix j%ex)) = (A) (A Dar F(A) CAD jp HAD AD jr (18) 


(Recall that A and thus A~! are symmetric.) Note that since (x;x j) =(474);;, the right- 
hand side of (16) can also be written in terms of objects such as (x;x j). Please work out 
(X;X jX_X]X_X,); you will become an expert on Wick contractions. Of course, (16) reduces 
to (10) for N = 1. 

Perhaps you are like me and do not like to memorize anything, but some of these formulas 
might be worth memorizing as they appear again and again in theoretical physics (and in 
this book). 


Appendix 2 


To do an exponential integral of the form J = [7° dge~/S@ we often have to resort 
to the steepest-descent approximation, which I will now review for your convenience. 
In the limit of # small, the integral is dominated by the minimum of f(g). Expanding 
£@ = f@+3f"(@@q —a)* + OL — a)*] and applying (9) we obtain 


4 1 
[=e W/)f@ (= ) e OF?) (19) 
f"(@) 

For f(q) a function of many variables g,,..., qj and with a minimum at qj =4;, we 

generalize immediately to 

1 
T=e Wife) (<,) ° o Oth?) (20) 
det f”(a) 


Here f(a) denotes the N by N matrix with entries [f"(@)];; = (3° f/84;94j)\q=a . Inmany 
situations, we do not even need the factor involving the determinant in (20). If you can derive 
(20) you are well on your way to becoming a quantum field theorist! 


Exercises 


1.2.1. Verify (5). 
1.2.2. Derive (16). 


Chapter 1.3 
From Mattress to Field 


The mattress in the continuum limit 


The path integral representation 
. re Lin? 
Z = (Oje'”7 (0) =| Dat) ei dy Hhgmd-V@l (1) 


which we derived for the quantum mechanics of a single particle, can be general- 
ized almost immediately to the case of N particles with the Hamiltonian 


1 ~ ~ an” ~ 
H=)° —— p+ V@i. 42+ +s Gy): (2) 
—~ 2m, 


We simply keep track mentally of the position of the particles g, witha = 1,2,---, 
N. Going through the same steps as before, we obtain 


Z= (je 1770) = | Dqit) (3) 


with the action 
r 2 
sa) =f dt( >> 4m_g2 — Vigi.42.°- +» gui). 


The potential energy V(q;, qo, ---, gy) now includes interaction energy between 
particles, namely terms of the form v(q, — q,), a8 Well as the energy due to an 
external potential, namely terms of the form w(q,). In particular, let us now write 
the path integral description of the quantum dynamics of the mattress described in 
Chapter I.1, with the potential 


1 
V@i da. 4n) =) Skabdade + 
ab 


We are now just a short hop and skip away from a quantum field theory! Suppose 
we are only interested in phenomena on length scales much greater than the lattice 
spacing / (see Fig. I.1.1). Mathematically, we take the continuum limit / + 0. In 


16 
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this limit, we can replace the label a on the particles by a two-dimensional position 
vector x, and so we write g(t, x) instead of g,(t). It is traditional to replace the 
Latin letter g by the Greek letter y. The function g(t, x) is called a field. 

The kinetic energy }°, 4m “fe now becomes f d*x10(4g/dt)*. We replace 
y., by 1/27 f d*x and denote the mass per unit area m,/ P? by o. We take all 
the m,’s to be equal; otherwise o would be a function of x, the system would be 
inhomogeneous, and we would have a hard time writing down a Lorentz-invariant 
action (see later). 

We next focus on the first term in V = )°,, 4hgnGa9p + ~~. Write 2g,qp = 
(da — 4)" — 92 — g7 . Assume for simplicity that k,,, connect only nearest neigh- 
bors on the lattice. For nearest-neighbor pairs (¢, — gp)” X P(ap/ax)* +--+ in 
the continuum limit; the derivative is obviously taken in the direction that joins the 
lattice sites a and b. 

Putting it together then, we have 


T 
S(q) > S@) = [ dt / d*xL(g) 
0 


r 1 dg 2 dg 2 dg , 
_ 2,/ op op og 
=f) J e<s|e (5) ~»| (52) +(%) 
rg’ —cyt+--| (4) 


where the parameters p and rt are determined by k,, and /. The precise relations 
do not concern us. 

Henceforth in this book, we will take the T -» oo limit so that we can integrate 
over all of spacetime in (4). 

We can clean up a bit by writing p = oc? and scaling y -> y/./c, so that 
the combination (89/81)? — c2[(8p/Ax)? + (89/8y)*] appears in the Lagrangian. 
The parameter c evidently has the dimension of a velocity and defines the phase 
velocity of the waves on our mattress. It is interesting that Lorentz invariance, with 
c playing the role of the speed of light, emerges naturally. 

We started with a mattress for pedagogical reasons. Of course nobody believes 
that the fields observed in Nature, such as the meson field or the photon field, are 
actually constructed of point masses tied together with springs. The modern view, 
which I will call Landau-Ginzburg, is that we start with the desired symmetry, say 
Lorentz invariance if we want to do particle physics, decide on the fields we want 
by specifying how they transform under the symmetry (in this case we decided on 
a scalar field y), and then write down the action involving no more than two time 
derivatives (because we don’t know how to quantize actions with more than two 
time derivatives). 

We end up with a Lorentz-invariant action (setting c = 1) 


l 1 g A 
S= dé -<@ 2 2 2,2... 8,3_ ™,,4 ven 5 
| | 5 @0 5m ee ae (5) 
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where various numerical factors are put in for later convenience. The relativistic 
notation (0)? = 0,,90"y = (89/dt)* — (8~/dx)* — (89/dy)* was explained in 
the note on convention. The dimension of spacetime, d, clearly can be any integer, 
even though in our mattress model it was actually 3. We often write d= D+ 1 
and speak of a (D + 1)-dimensional spacetime. 

We see here the power of imposing a symmetry. Lorentz invariance together 
with the insistence that the Lagrangian involve only at most two powers of 0/dt 
immediately tells us that the Lagrangian can only have the form! £L =1(dg)? — 
V(g) with V a polynomial in gy. We will have a great deal more to say about 
symmetry later. Here we note that, for example, we could insist that physics 
is symmetric under gy — —g, in which case V(g) would have to be an even 
polynomial. 

Now that you know what a quantum field theory is, you realize why I used the 
letter q to label the position of the particle in the previous chapter and not the more 
common x. In quantum field theory, x is a label, not a dynamical variable. The x 
appearing in g(t, x) corresponds to the label a in q,(f) in quantum mechanics. 
The dynamical variable in field theory is not position, but the field gy. The variable 
x simply specifies which field variable we are talking about. I belabor this point 
because upon first exposure to quantum field theory some students, used to thinking 
of x as a dynamical operator in quantum mechanics, are confused by its role here. 

In summary, we have the table 


7m 6 
Galt) > ot, 2) = (x) (©) 


Thus We finally have the path integral defining a scalar field theory ind = (D + 1) 
dimensional spacetime: 


; dy 71 
z=| Doe! f 42(400-Vioy (7) 


Note that a (0 + 1)-dimensional quantum field theory is just quantum 
mechanics. 


1 Strictly speaking, a term of the form U (g)(4q)’ is also possible. In quantum mechanics, 
a term such as U(q)(dq/dt)? in the Lagrangian would describe a particle whose mass 
depends on position. We will not consider such “nasty” terms until much later. 
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The classical limit 


As Ihave already remarked, the path integral formalism is particularly convenient 
for taking the classical limit. Remembering that Planck’s constant / has the dimen- 
sion of energy multiplied by time, we see that it appears in the unitary evolution 
operator e(—'/)#T. Tracing through the derivation of the path integral, we see that 
we have to simply divide the overall factor i by f to get 


ZK | Dgell® f axho) (8) 


In the limit with # much smaller than the relevant action we are considering, 
we can evaluate the path integral using the stationary phase (or steepest descent) 
approximation, as I explained in the previous chapter in the context of quantum 
mechanics. We simply determine the extremum of f d*x£(g). According to the 
usual Euler-Lagrange variational procedure, this leads to the equation 


~~ =9 (9) 


We thus recover the classical field equation, exactly as we should, which in our 
scalar field theory reads 


P+ m p(x) + Eo + =x)" $0 (10) 


The vacuum 


In the point particle quantum mechanics discussed in Chapter I.2 we wrote the path 
integral for (F |e!" {7), with some initial and final state, which we can choose 
at our pleasure. A convenient and particularly natural choice would be to take 
|Z) = |F) to be the ground state. In quantum field theory what should we choose 
for the initial and final states? A standard choice for the initial and final states 
is the ground state or the vacuum state of the system, denoted by |0), in which, 
speaking colloquially, nothing is happening. In other words, we would calculate 
the quantum transition amplitude from the vacuum to the vacuum, which would 
enable us to determine the energy of the ground state. But this is not a particularly 
interesting quantity, because in quantum field theory we would like to measure 
all energies relative to the vacuum and so, by convention, would set the energy 
of the vacuum to zero (possibly by having to subtract an infinite constant from 
the Lagrangian). Incidentally, the vacuum in quantum field theory is a stormy sea 
of quantum fluctuations, but for this initial pass at quantum field theory, we will 
not examine it in any detail. We will certainly come back to the vacuum in later 
chapters. 
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Ed 


Figure 1.3.1 


Disturbing the vacuum 


We would like to do something more exciting than watching a boiling sea of 
quantum fluctuations. We would like to disturb the vacuum. Somewhere in space, 
at some instant in time, we would like to create a particle, watch it propagate for 
a while, and then annihilate it somewhere else in space, at some later instant in 
time. In other words, we want to create a source and a sink (sometimes referred to 
collectively as sources) at which particles can be created and annihilated. 

To see how to do this, let us go back to the mattress. Bounce up and down 
on it to Create some excitations. Obviously, pushing on the mass labeled by a 
in the mattress corresponds to adding a term such as J,(t)g, to the potential 
V(q1. %,°*-»4n)- More generally, we can add 5°, J,(t)qq. When we go to field 
theory this added term gets promoted to J(x)g(x) in the field theory Lagrangian, 
according to the promotion table (6). 

This so-called source function J(t, x) describes how the mattress is being 
disturbed. We can choose whatever function we like, corresponding to our freedom 
to push on the mattress wherever and whenever we like. In particular, J(x) can 
vanish everywhere in spacetime except in some localized regions. 

By bouncing up and down on the mattress We can get wave packets going off 
here and there (Fig. 1.3.1). This corresponds precisely to sources (and sinks) for 
particles. Thus, we really want the path integral 


Z =| Doe f 4°x13.@9"-V@+J OI] (11) 
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Free field theory 


The functional integral in (11) is impossible to do except when 


L(y) = 41)? — mo") (12) 


The corresponding theory is called the free or Gaussian theory. The equation of 
motion (9) works out to be (37 + m)y = 0, known as the Klein-Gordon equation.’ 


Being linear, it can be solved immediately to give p(X, t) = e'( —2) with 
wo =k? +m? (13) 


In the natural units we are using, f& = 1 and so frequency @ is the same as energy 
Aw and wave vector k is the same as momentum fk. Thus, we recognize (13) as 
the energy-momentum relation for a particle of mass m, namely the sophisticate’s 
version of the layperson’s E = mc?. We expect this field theory to describe a 
relativistic particle of mass m. 

Let us now evaluate (11) in this special case: 


z= / Doel f Bx qlee Jo} (14) 


Integrating by parts under the { d*x and not worrying about the possible contri- 
bution of boundary terms at infinity (we implicitly assume that the fields we are 
integrating over fall off sufficiently rapidly), we write 


T= / Doe! f U'x1- 790? +m 9+ Jol (15) 


You will encounter functional integrals like this again and again in your study 
of field theory. The trick is to imagine discretizing spacetime. You don’t actu- 
ally have to do it: Just imagine doing it. Let me sketch how this goes. Replace 
the function g(x) by the vector y; = y(ia) with i an integer and a the lattice 
spacing. (For simplicity, Iam writing things as if we were in 1-dimensional space- 
time. More generally, just let the index i enumerate the lattice points in some 
way.) Then differential operators become matrices. For example, dg(ia) — (1/a) 
(9141 — 9) = 0; Mij@;, with some appropriate matrix M. Integrals become 
sums. For example, f d*xJ(x)o(x) > a* ), Jig;. 

Now, lo and behold, the integral (15) is just the integral we did in (1.2.15) 


2 The Klein-Gordon equation was actually discovered by Schrédinger before he found 
the equation that now bears his name. Later, in 1926, it was written down independently by 
Klein, Gordon, Fock, Kudar, de Donder, and Van Dungen. 


22 Ll. Motivation and Foundation 


+00 p+oo +00 
/ J wee J dgq)dq, --- dqy ef/ 4 A444 4 
—0°O -—0O —0°O 


1 
_ (ry pe @/DF-AN (16) 
det[A] 


The role of A in (16) is played in (15) by the differential operator — (8? + m7). 
The defining equation for the inverse A - A~! = J or A; Air = §;, becomes in the 
continuum limit 


(8° + m*) D(x — y) =8O(~% — y) (17) 
We denote the continuum limit of Ay by D(x — y) (which we know must be a 
function of x — y, and not of x and y separately, since no point in spacetime is 
special). Note that in going from the lattice to the continuum Kronecker is replaced 
by Dirac. It is very useful to be able to go back and forth mentally between the 
lattice and the continuum. 
Our final result is 


Z(J) = Ce I/D Jf PxayI@)DE-YIO) — EgiWU) (18) 


with D(x) determined by solving (17). The overall factor ©, which corresponds to 
the overall factor with the determinant in (16), does not depend on J and, as will 
become clear in the discussion to follow, is often of no interest to us. As a rule I 
will omit writing C altogether. Clearly, €C = Z(J = 0) so that W(J) is defined by 


Z(J)=Z(J =0)e'¥M (19) 


Observe that 
WW) = -> | | d*xd*yJ(x)D(x — y)J(y) (20) 


is a simple quadratic functional of J. In contrast, Z(J) depends on arbitrarily high 
powers of J. This fact will be of importance in Chapter I.7. 


Free propagator 


The function D(x), known as the propagator, plays an essential role in quantum 
field theory. As the inverse of a differential operator it is clearly closely related to 
the Green’s function you encountered in a course on electromagnetism. 
Physicists are sloppy about mathematical rigor, but even so, they have to be 
careful once in a while to make sure that what they are doing actually makes sense. 
For the integral in (15) to converge for large g we replace m? —> m* — ig so that 
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the integrand contains a factor e ° f dag? where ¢ is a positive infinitesimal? we 
will let tend to zero later. 

We can solve (17) easily by going to momentum space and recalling the repre- 
sentation of the Dirac delta function 


d*k 
s(x —y)= / On) {ete (21) 


The solution is 
d*k eik@ —y) 


Dx —yy= | —__* ——_ 
— ¥) (271)* k2 — m? + ie 


(22) 
which you can check by plugging into (17). Note that the so-called ie prescription 
we just mentioned is essential; otherwise the & integral would hit a pole. 

To evaluate D(x) we first integrate over k° by the method of contours. De- 
fine @, = +v k2 + m2. The integrand has two poles in the complex k® plane, at 
+, let — ie, which in the e — 0 limit is equal to +a, — ie and —a, + ie. For x? 
positive we can extend the integration contour that goes from —oo to +00 on the 
real axis to include the infinite semicircle in the upper half-plane, thus enclosing 
the pole at —a, + ie and giving —i f[d°k/ (27 )72@, Je! (oxt—k-*)_ For x negative 
we Close the contour in the lower half-plane. Thus 


d>k ; te . 7 
__ —ifayt—k-x) 0 ifayt—k-xyg¢__ 0 
Dww)=-1 / Qn)2e, feo <k A(x") + ee 6(-x")} = (23) 


Physically, D(x) describes the amplitude for a disturbance in the field to prop- 
agate from the origin to x. We expect drastically different behavior depending 
on whether x is inside or outside the lightcone. Without evaluating the inte- 
gral we can see roughly how things go. For x = (¢, 0) with, say, > 0, Dx) = 
—i [[d?k/(27)°2@,Je~'@# is a superposition of plane waves and thus we should 
have oscillatory behavior, In contrast, for x9 =0, we have, upon interpreting 
6(0)=4, Da) =-i f [dk /Qny2v k2 + m?] e ‘k= and the square root cut 
starting at tim leads to an exponential decay ~ e™l#l, as we would expect. Clas- 
sically, a particle cannot get outside the lightcone, but 2 quantum field can “leak” 
out over a distance of the order m~!, 


Exercises 


1.3.1. Verify that D(x) decays exponentially for spacelike separation. 


1.3.2. Work out the propagator D(x) for a free-field theory in (1 + 1)-dimensional 
spacetime and study the large x! behavior for x° = 0. 


3 As is customary, ¢ is treated as generic, so that ¢ multiplied by any positive number is 
still e. 


Chapter 1.4 


From Field to Particle to Force 


From field to particle 


In the previous chapter we obtained for the free theory 
1 
WW) =—5 | | d*xd*y J(x)D(x — y)J(y) (1) 


which we now write in terms of the Fourier transform J (k) = f[ d*xe—*** F(x): 


1 d*k ‘ 1 
WW) = —> One? ome ™ (2) 


[Note that J(k)* = J(—k) for J(x) real.] 

We can jump up and down on the mattress any way we like. In other words, we 
can choose any J/(x) we want, and by exploiting this freedom of choice, we can 
extract a remarkable amount of physics. 

Consider J (x) = J,(x) + Jo(x), where J, (x) and J,(x) are concentrated in two 
local regions 1 and 2 in spacetime (Fig. I.4.1). Then W(/) contains four terms, of 
the form J J,, Jf Jo, Ji‘ Jo, and Jy J,. Let us focus on the last two of these terms, 
one of which reads 


1 
W(J) =—- J (k)*———__—_ 
=~) (204 2) me tie 


J(k) (3) 


We see that W(/) is large only if J;(x) and J>(x) overlap significantly in their 
Fourier transform and if in the region of overlap in momentum space k* — m? 
almost vanishes. There is a “resonance type” spike atk? = m”, thatis, if the energy- 
momentum relation of a particle of mass m is satisfied. (We will use the language of 
the relativistic physicist, writing “momentum space” for energy-momentum space, 
and lapse into nonrelativistic language only when the context demands it, such as 
in “energy-momentum relation.”) 

We thus interpret the physics contained in our simple field theory as follows: 
In region 1 in spacetime there exists a source that sends out a “disturbance in the 
field,” which is later absorbed by a sink in region 2 in spacetime. Experimentalists 
choose to call this disturbance in the field a particle of mass m. Our expectation 
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>| 


Figure L4.1 


based on the equation of motion that the theory contains a particle of mass m is 
fulfilled. 

A bit of jargon: When k* = m?, k is said to be on mass shell. Note, however, 
that in (3) we integrate over all k, including values of & far from the mass shell. 
For arbitrary k, it is a linguistic convenience to say that a “virtual particle” of 
momentum k propagates from the source to the sink. 


From particle to force 


We can now go on to consider other possibilities for J (x) (which we will refer 
to generically as sources), for example, J(x) = J,(x) + Jo(x), where J,(x) = 
§3)(x% — X,). In other words, J(x) is a sum of sources that are time-independent 
infinitely sharp spikes located at x, and x, in space. (If you like more mathematical 
rigor than is offered here, you are welcome to replace the delta function by lumpy 
functions peaking at x,. You would simply clutter up the formulas without gaining 
much.) More picturesquely, we are describing two massive lumps sitting at x, and 
X> on the mattress and not moving at all [no time dependence in J (x)]. 

What do the quantum fluctuations in the field g, that is, the vibrations in the 
mattress, do to the two lumps sitting on the mattress? If you expect an attraction 
between the two lumps, you are quite right. 

As before, W(J) contains four terms. We neglect the “self-interaction” term 
J\ J, since this contribution would be present in W regardless of whether J, is 
present or not. We want to study the interaction between the two “massive lumps” 
represented by J, and J,. Similarly we neglect J; J. 

Plugging into (1) and doing the integral over d>x and d°y we immediately obtain 
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dk 0 -yo [ Bk — el i-*) 
W(J/)=- dx °d of ——_ gik (x—y) ———_—__—_. 4 
) II y Qn (27)? k? — m2 +ie 4) 


(The factor 2 comes from the two terms J>J, and J, J). ) Integrating over y° we 
get a delta function setting &° to zero (so that & is certainly not on mass shell, to 
throw the jargon around a bit). Thus we are left with 


ak elk 4-2) 
WJ) =( | dx® =—— 5 
” (/ *') QxP 2+ m ©) 


Note that the infinitesimal ie can be dropped since the denominator k? + m? is 
always positive. 

The factor (f dx®) should have filled us with fear and trepidation: an integral 
over time, it seems to be infinite. Fear not! Recall that in the path integral formalism 
Z =Ce'"Y) represents (O[ e777 (0) =e "=", where E is the energy due to the 
presence of the two sources acting on each other. The factor (f dx®°) produces 
precisely the time interval 7. All is well. Setting iW =i£T we obtain from (5) 


3, pik-(@|—%) 
g-_| 2ee (6) 
(20)? k2 + m2 


This energy is negative! The presence of two delta function sources, at x, and 
X>, has lowered the energy. In other words, the two sources attract each other by 
virtue of their coupling to the field g . We have derived our first physical result in 
quantum field theory! 

We identify E as the potential energy between two static sources. Even without 
doing the integral we see that as the separation x, — x, between the two sources 
becomes large, the oscillatory exponential cuts off the integral. The characteristic 
distance is the inverse of the characteristic value of k, which is m. Thus, we expect 
the attraction between the two sources to decrease rapidly to zero over the distance 
1/m. 

The range of the attractive force generated by the field g is determined inversely 
by the mass m of the particle described by the field. Got that? 

The integral is done in the appendix to this chapter and gives 


1 
E= “ie? (7) 


The result is as we expected: The potential drops off exponentially over the distance 
scale 1/m. Obviously, dE'/dr > 0: The two massive lumps sitting on the mattress 
can lower the energy by getting closer to each other. 

What we have derived was one of the most celebrated results in twentieth- 
century physics. Yukawa proposed that the attraction between nucleons in the 
atomic nucleus is due to their coupling to a field like the g field described here. 
The known range of the nuclear force enabled him to predict not only the existence 
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of the particle associated with this field, now called the 7 meson! or the pion, but 
its mass as well. As you probably know, the pion was eventually discovered with 
essentially the properties predicted by Yukawa. 


Origin of force 


That the exchange of a particle can produce a force was one of the most profound 
conceptual advances in physics. We now associate a particle with each of the known 
forces: for example, the photon with the electromagnetic force and the graviton 
with the gravitational force; the former is experimentally well established and the 
latter while it has not yet been detected experimentally hardly anyone doubts its 
existence. We will discuss the photon and the graviton in the next chapter, but 
we can already answer a question smart high school students often ask: Why do 
Newton’s gravitational force and Coulomb’s electric force both obey the 1/7? law? 

We see from (7) that if the mass m of the mediating particle vanishes, the force 
produced will obey the 1/r? law. If you trace back over our derivation, you will see 
that this comes about from the fact that the Lagrangian density for the simplest field 
theory involves two powers of the spacetime derivative 0 (since any term involving 
one derivative such as g @¢ is not Lorentz invariant). Indeed, the power dependence 


of the potential follows simply from dimensional analysis: f d3k(ei** /k2) ~ L/r. 


Connected versus disconnected 


We end with a couple of formal remarks of importance to us only in Chapter 1.7. 
First, note that we might want to draw a small picture Fig.(1.4.2) to represent the 
integrand J(x)D(x — y)J(y) in W(J): A disturbance propagates from y to x (or 
vice versa). In fact, this is the beginning of Feynman diagrams! Second, recall that 


Z(J)=Z(J =0) >> ae 


n=0 


For instance, the n = 2 term in Z(/)/Z(J = 0) is given by 


1 >\2 
Ti (-3) fff d'x,d4x)d4*x3d*x4D(x — X>) 
D(x3 — X4) J (x1) F (42) F (3) J (4) 


The integrand is graphically described in Figure 1.4.3. The process is said to be 
disconnected: The propagation from x, to x, and the propagation from x3 to x4 


' The etymology behind this word is quite interesting (A. Zee, Fearful Symmetry: see pp. 
169 and 335 to jearn, among other things, the French objection and the connection between 
meson and illusion). 
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Figure L.4.2 


Figure L4.3 
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proceed independently. We will come back to the difference between connected 
and disconnected in Chapter I.7. 

Appendix 


Writing x = (x; — x») and u =cos6@ with 6 the angle between k and X, we evaluate the 
integral in (6) spherical coordinates (with k = jk| and r = |X|) to be 


eikru 9 co sin kr 
d dkk 8 
=a [oa an "e+ m2 + m (27)*ir k2 + m2 8) 


Since the integrand is even, we can extend the integral and write it as 


1° sin kr 1 {* 1 . 
_ dkk = — dk k ———-e'* 
2 [ ke + m2 2i —oo ke + m2 


Since r is positive, we can close the contour in the upper half-plane and pick up the pole at 
+im, obtaining (1/2i)(27i)(im/2im)e—™ = (1 /2)e—"".. Thus, J = (1/4ar)e~™. 
Exercise 


1.4.1. Calculate the analog of the inverse square law in a (2 + 1)-dimensional universe, 
and more generally in a (D + 1)-dimensional universe. 


Chapter I.5 


Coulomb and Newton: 
Repulsion and Attraction 


Why like charges repel 


We suggested that quantum field theory can explain both Newton’s gravitational 
force and Coulomb’s electric force naturally. Between like objects Newton’s force 
is attractive while Coulomb’s force is repulsive. Is quantum field theory “smart 
enough” to produce this observational fact, one of the most basic in our under- 
standing of the physical universe? You bet! 

We will first treat the quantum field theory of the electromagnetic field, known 
as quantum electrodynamics or QED for short. In order to avoid complications 
at this stage associated with gauge invariance (about which much more later) I 
will consider instead the field theory of a massive spin 1 meson, or vector meson. 
After all, experimentally all we know is an upper bound on the photon mass, 
which although tiny is not mathematically zero. We can adopt a pragmatic attitude: 
Calculate with a photon mass m and set m = 0 at the end, and if the result does 
not blow up in our faces, we will presume that it is OK.! 

Recall Maxwell’s Lagrangian for electromagnetism £ = —jF,,, F*", where 
Fy =9,A, — 0,A,, with A, (x) the vector potential. You can see the reason for 
the important overall minus sign in the Lagrangian by looking at the coefficient 
of (3)A;)”, which has to be positive, just like the coefficient of (49) in the 
Lagrangian for the scalar field. This says simply that time variation should cost 
a positive amount of action. 

I will now give the photon a small mass by changing the Lagrangian to £ = 
—!F,,F#" + 1m?A, A" + A, J". (The mass term is written in analogy to the 
mass term m7” in the scalar field Lagrangian; we will see later that it is indeed 
the photon mass.) I have also added a source J“ (x) ,which in this context is more 
familiarly known as a current. We will assume that the current is conserved so that 
0,J% =0. 


' When I took a field theory course as a student with Sidney Coleman this was how he 
treated QED in order to avoid discussing gauge invariance. 
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Well, you know that the field theory of our vector meson is defined by the path 
integral Z = f DA eS“ = e!¥/) with the action 


S(A) = J dxh= J d’x{5A,,[(0° + m)g#” — a4a"JA, + A,J*} (1) 


The second equality follows upon integrating by parts [compare (1.3.15)]. 

By now you have learned that we simply apply (1.3.16). We merely have to find 
the inverse of the differential operator in the square bracket; in other words, we 
have to solve 


[(? + m?)g@” — a¥ 9" ID, (x) = 878 (x) (2) 
As before [compare (1.3.17)] we go to momentum space by defining 


d*k 


ikx 
(Qn 4 Dy) (k)e 


D(x) = 


Plugging in, we find that [—(k? — m?)g#” + k#k’ JD, (k) = 6”, giving 


=8y) + kyk,/m? 


(3) 
This is the photon, or more accurately the massive vector meson, propagator. Thus 


1 d4 —gv tk, ky/m? 
W(J)=-- A pet Sue / 


—__—____]” 4 
2J (2Qn)4 k? — m2 +ie 7 oe 


Since current conservation 0,,J“(x) = 0 gets translated into momentum space 
as kJ" (k) =0, we can throw away the k,,k, term in the photon propagator. The 
effective action simplifies to 

4 
1 d'k TH 


1 
OFF fem O ner ° 


No further computation is needed to obtain a profound result. Just compare this 
result to (1.4.2). The field theory has produced an extra sign. The potential energy 
between two lumps of charge density J°(x) is positive. The electromagnetic force 
between like charges is repulsive! 

We can now safely let the photon mass m go to zero thanks to current conser- 
vation, [Note that we could not have done that in (3).] Indeed, referring to (1.4.7) 
we see that the potential energy between like charges is 


E=—e™ > — (6) 


To accommodate positive and negative charges we can simply write J4 = 
J P — J, We see that a lump with charge density J > is attracted to a lump with 


charge density J, 
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Bypassing Maxwell 


Having done electromagnetism in two minutes flat let me now do gravity. Let us 
move on to the massive spin 2 meson field. In my treatment of the massive spin 1 
meson field I took a short cut. Assuming that you are familiar with the Maxwell 
Lagrangian, I simply added a mass term to it and took off. But I do not feel com- 
fortable assuming that you are equally familiar with the corresponding Lagrangian 
for the massless spin 2 field (the so-called linearized Einstein Lagrangian, which 
I will discuss in a later chapter). So here I will follow another strategy. 

I invite you to think physically, and together we will arrive at the propagator 
for a massive spin 2 field. First, we will warm up with the massive spin 1 case. 

In fact, start with something even easier: the propagator D(k) = 1/(k* — m7) 
for a massive spin 0 field. It tells us that the amplitude for the propagation of a 
spin 0 disturbance blows up when the disturbance is almost a real particle. The 
residue of the pole is a property of the particle. The propagator for a spin 1 field 
D,,, carries a pair of Lorentz indices and in fact we know what it is from (3): 


—G 
Dy lk) = 7 (7) 
where for later convenience we have defined 
kk 
Gin) = 8 — > (8) 


Let us now understand the physics behind G,, . I expect you to remember the 
concept of polarization from your course on electromagnetism. A massive spin 
1 particle has three degrees of polarization for the obvious reason that in its rest 
frame its spin vector can point in three different directions. The three polarization 
vectors e(@) are simply the three unit vectors pointing along the x, y, and z axes, 
respectively (a = 1, 2, 3): e(? =(0, 1, 0, 0), e?) = (0,0, 1, 0), e® = (0, 0,0, 2). 
In the rest frame k* = (m, 0, 0, 0) and so 


(a) _ 
kKes =0 (9) 


Since this is a Lorentz invariant equation, it holds for a moving spin 1 particle as 
well. Indeed, with a suitable normalization condition this fixes the three polariza- 
tion vectors e(k) for a particle with momentum k. 

The amplitude for a particle with momentum k and polarization a to be created 
at the source is proportional to e (k), and the amplitude for it to be absorbed 
at the sink is proportional to 64) (k). We multiply the amplitudes together to get 
the amplitude for propagation from source to sink, and then sum over the three 
possible polarizations. Now we understand the residue of the pole in the spin | 
propagator D,,, (k): It represents >°, 6) (k) 6 (k) . To calculate this quantity, 
note that by Lorentz invariance it can only be a linear combination of g,, and k,k,. 
The condition k“e™ = 0 fixes the combination as g,, — k,k,/m”. We evaluate 
the left-hand side for k& at rest with vy = A = 1, for instance, and fix the overall and 
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all-crucial sign to be —1. Thus 


a ky 
- eM (ket (k) = —G13(k) =— («0 _ a) (10) 


a 


We have thus constructed the propagator D,,,(k) for a massive spin 1 particle, 
bypassing Maxwell. Onward to spin 2! We want to similarly bypass Einstein. 


Bypassing Einstein 


A massive spin 2 particle has 5 (2-2 + 1=5, remember?) degrees of polarization, 
characterized by the five polarization tensors e(@) (a=1,2,---, 5) symmetric in 
the indices y and v satisfying 


k#e @) =0 (11) 
and the tracelessness condition 
girl) =0 (12) 


Let’s count as a check. A symmetric Lorentz tensor has 4 - 5/2 = 10 components. 
The four conditions in (11) and the single condition in (12) cut the number of com- 
ponents down to 10 — 4 — 1=5, precisely the right number. (Just to throw some 
jargon around, remember how to construct irreducible group representations? If 
not, read Appendix C.) We fix the normalization of ¢,,, by setting the positive 
quantity >", 6) (ke ()=1 

So, in analogy with the spin 1 case we now determine )~, a) (Ke (k). We 
have to construct this object out of g,,, and k,,, or equivalently G,,,, and k,,. This 


quantity must be a linear combination of terms such as G,,,G,,, G,, kjk, and 
so forth. Using (11) and (12) repeatedly (Exercise 1.5.1) you will easily find that 


» en) (Ke? (k) = (Gui Gio + GyoGy,) ~~ 5G wwGio (13) 
a 


The overall sign and proportionality constant are determined by evaluating both 
sides for 2 = A = land v =o =2, for instance. 
Thus, we have determined the propagator for a massive spin 2 particle 


(GyirGve + GyoGyy) — GG yy Gig 


aS (14) 


Divs aa (k) = 


Why we fall 


We are now ready to understand one of the fundamental mysteries of the universe: 
Why masses attract. 
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Recall from your courses on electromagnetism and special relativity that the 
energy or mass density out of which mass is composed is part of a stress-energy 
tensor T“”. For our purposes, in fact, all you need to remember is that it is a 
symmetric tensor and that the component T is the energy density. 

To couple to the stress-energy tensor, we need a tensor field g,,,, symmetric in 
its two indices. In other words, the Lagrangian of the world should contain a term 
like y,,,, 7%”. This is in fact how we know that the graviton, the particle responsible 
for gravity, has spin 2, just as we know that the photon, the particle responsible for 
electromagnetism and hence coupled to the current J“, has spin 1. In Einstein’s 
theory, which we will discuss in a later chapter, ¢,,,, is of course part of the metric 
tensor. 

Just as we pretended that the photon has a smal! mass to avoid having to discuss 
gauge invariance, we will pretend that the graviton has a small mass to avoid having 
to discuss general coordinate invariance.” Aha, we just found the propagator for a 
massive spin 2 particle. So let’s put it to work. 

In precise analogy to (4) 


aK ye Suv tk k,/m? 
Gx Om +ie © a>) 


describing the interaction between two electromagnetic currents, the interaction 
between two lumps of stress energy is described by 


W(T) 
_ 1 d*k pi pyr GurGve + GyoGyy) _ §GuvGio T*? (k) (16) 
2J (2nr)4 k? — m2 +ie 


From the conservation of energy and momentum 0,7""(x) =0 and hence 
k,T¥*(k) =0, we can replace G,,, in (16) by g,,,. 

Now comes the punchline. Look at the interaction between two lumps of energy 
density T°. We have from (16) that 


1 f d*k 7p jtta5 00 
Ww - 1 
| ane & “Pom+ie © CD 


Comparing with (5) and using the well-known fact that (1+ 1— 3) > 0, we see 
that while like charges repel, masses attract. Trumpets, please! 


? For the moment, I ask you to ignore all subtleties and simply assume that in order to 
understand gravity it is kosher to let m > 0. I will give a precise discussion of Einstein’s 
theory of gravity in Chapter VIII.1. 
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The universe 


It is difficult to overstate the importance (not to speak of the beauty) of what we 
have learned: The exchange of a spin 0 particle produces an attractive force, of 
a spin | particle a repulsive force, and of a spin 2 particle an attractive force, 
realized in the hadronic strong interaction, the electromagnetic interaction, and the 
gravitational interaction, respectively. The universal attraction of gravity produces 
an instability that drives the formation of structure in the early universe.* Denser 
regions become denser yet. The attractive nuclear force mediated by the spin 
O particle eventually ignites the stars. Furthermore, the attractive force between 
protons and neutrons mediated by the spin 0 particle is able to overcome the 
repulsive electric force between protons mediated by the spin 1 particle to form a 
variety of nuclei without which the world would certainly be rather boring. The 
repulsion between likes and hence attraction between opposites generated by the 
spin 1 particle allow electrically neutral atoms to form. 

The world results from a subtle interplay among spin 0, 1, and 2. 

In this lightning tour of the universe, we did not mention the weak interaction. 
In fact, the weak interaction plays a crucial role in keeping stars such as our sun 
burning at a steady rate. 


Degrees of freedom 


Now for a bit of cold water: Logically and mathematically the physics of a particle 
with mass m # 0 could well be different from the physics with m = 0. Indeed, 
we know from classical electromagnetism that an electromagnetic wave has 2 
polarizations, that is, 2 degrees of freedom. For a massive spin 1 particle we can go 
to its rest frame, where the rotation group tells us that there are 2-1 + 1 = 3 degrees 
of freedom. The crucial piece of physics is that we can never bring the massless 
photon to its rest frame. Mathematically, the rotation group S O (3) degenerates into 
SO(2), the group of 2-dimensional rotations around the direction of the photon’s 
momentum. 

We will see in Chapter II.7 that the longitudinal degree of freedom of a massive 
spin 1 meson decouples as we take the mass to zero. The treatment given here 
for the interaction between charges (6) is correct. However, in the case of gravity, 
the 7 in (17) is replaced by 1 in Einstein’s theory, as we will see Chapter VIII.1. 
Fortunately, the sign of the interaction given in (17) does not change. Mute the 
trumpets a bit. 


3A good place to read about gravitational instability and the formation of structure in 
the universe along the line sketched here is in A. Zee, Einstein’s Universe (formerly known 
as An Old Man’s Toy). 
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Appendix 


Pretend that we never heard of the Maxwell Lagrangian. We want to construct a relativistic 
Lagrangian for a massive spin 1 meson field. Together we will discover Maxwell. Spin 
1 means that the field transforms as a vector under the 3-dimensional rotation group. 
The simplest Lorentz object that contains the 3-dimensional vector is obviously the 4- 
dimensional vector. Thus, we start with a vector field A, (x). 

That the vector field carries mass m means that it satisfies the field equation 


(8 +m’)A, =0 (18) 


A spin 1 particle has 3 degrees of freedom [remember, in fancy language, the representation 
j of the rotation group has dimension (27 + 1); here j = 1.] On the other hand, the field 
A,,(x) contains 4 components. Thus, we must impose a constraint to cut down the number 
of degrees from 4 to 3. The only Lorentz covariant possibility (linear in A,,) is 


3,4" =0 (19) 


It may also be helpful to look at (18) and (19) in momentum space, where they read 
(k? — m?)A, (k) = 0 and k, A“(k) = 0. The first equation tells us that k” = m* and the 
second that if we go to the rest frame k* = (m, 0) then A° vanishes, leaving us with 3 
nonzero components A’ with i = 1, 2, 3. 

The remarkable observation is that we can combine (18) and (19) into a single equation, 
namely 


(gta? — a#a")A, + mA“ =0 (20) 


Verify that (20) contains both (18) and (19). Act with 3,, on (20). We obtain m?a,, A“ =0, 
which implies that 3,,A“ =0 . (At this step it is crucial that m # 0 and that we are not 
talking about the strictly massless photon.) We have thus obtained (19 ); using (19) in (20) 
we recover (18). 

We can now construct a Lagrangian by multiplying the left-hand side of (20) by +34, 
(the 4 is conventional but the plus sign is fixed by physics, namely the requirement of 
positive kinetic energy); thus 


£ = 5A,[(87 + m*)gh” — atarJA, (21) 


Integrating by parts, we recognize this as the massive version of the Maxwell Lagrangian. 
In the limit m — 0 we recover Maxwell. 

A word about terminology: Some people insist on calling only F,,, a field and A,, a 
potential. Conforming to common usage, we will not make this fine distinction. For us, any 
dynamical function of spacetime is a field. 


Exercises 


L5.1. Write down the most general form for }°, ce) (ke? (k) using symmetry repeat- 
edly. For example, it must be invariant under the exchange {jzv <> Aa}. You might 
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end up with something like 
AG, Gig + BIG yi Gye + GysGyy) + C(Gyyky ko + kk, Gig) 
+ Dky ky. Gye + hyko Gy + kiko Gyan + kyki Guo) + Ekykyky ky (22) 


with various unknown A, ---, E. Apply k” 7, €(k)e\2(k) = 0 and find out 
what that implies for the constants. Proceeding in this way, derive (13). 


Chapter I.6 


Inverse Square Law and the 
Floating 3-Brane 


Why inverse square? 


In your first encounter with physics, didn’t you wonder why an inverse square 
force law and not, say, an inverse cube law? You now have the deep answer. When a 
massless particle is exchanged between two particles, the potential energy between 
the two particles goes as 


V(r) « J dk FF + (1) 
ke or 
The spin of the exchanged particle controls the overall sign, but the 1/r follows 
just from dimensional analysis, as I remarked earlier. Basically, V (r) is the Fourier 
transform of the propagator. The K in the propagator comes from the (d;¢ - 
d;¢) term in the action, where g denotes generically the field associated with 
the massless particle being exchanged, and the (d;y - 0;¢) form is required by 
rotational invariance. It couldn’t be & or K? in (1); k? is the simplest possibility. 
So you can say that in some sense ultimately the inverse square law comes from 
rotational invariance! 
Physically, the inverse square law goes back to Faraday’s flux picture. Consider 
a sphere of radius r surrounding a charge. The electric flux per unit area going 
through the sphere varies as 1/4r. This geometric fact is reflected in the factor 
d°k in (1). 


Brane world 


Remarkably, with the tiny bit of quantum field theory I have exposed you to, I can 
already take you to the frontier of current research, current as of the writing of this 
book. In string theory, our (3 + 1)-dimensional world could well be embedded in 
a larger universe, the way a (2 + 1)-dimensional sheet of paper is embedded in our 
everyday (3 + 1)-dimensional world. We are said to be living on a 3 brane. 

So suppose there are n extra dimensions, with coordinates x4 x ott xnt3, 
Let the characteristic scales associated with these extra coordinates be R. I can’t 
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go into the different detailed scenarios describing what R is precisely. For some 
reason I can’t go into either, we are stuck on the 3 brane. In contrast, the graviton 
is associated intrinsically with the structure of spacetime and so roams throughout 
the (n + 3 + 1)-dimensional universe. 
All right, what is the gravitational force law between two particles? It is surely 
not your grandfather’s gravitational force law: We Fourier transform 
iz l 1 


34n, ike + 
via fd ke 72 % pir 


(2) 


to obtain a 1/r!*” law. 

Doesn’t this immediately contradict observation? 

Well, no, because Newton’s law continues to hold for r >> R. In this regime, 
the extra coordinates are effectively zero compared to the characteristic length 
scale r we are interested in. The flux cannot spread far in the direction of the n 
extra coordinates. Think of the flux being forced to spread in only the three spatial 
directions we know, just as the electromagnetic field in a wave guide is forced to 
propagate down the tube. Effectively we are back in (3 + 1)-dimensional spacetime 
and V(r) reverts to a 1/r dependence. 

The new law of gravity (2) holds only in the opposite regime r < R. Heuris- 
tically, when R is much larger than the separation between the two particles, the 
flux does not know that the extra coordinates are finite in extent and thinks that it 
lives in an (n + 3 + 1)-dimensional universe. 

Because of the weakness of gravity, Newton’s force law has not been tested 
to much accuracy at laboratory distance scales, and so there is plenty of room for 
theorists to speculate in: R could easily be much larger than the scale of elementary 
particles and yet much smaller than the scale of everyday phenomena. Incredibly, 
the universe could have “large extra dimensions”! (The word “large” means large 
on the scale of particle physics.) 


Planck mass 


To be quantitative, let us define the Planck mass Mp, by writing Newton’s law more 
rationally as V(r) = Gym m)(1/r) = (mymo/M3,)(1/r). Numerically, Mp, ~ 
10!° Gev. This enormous value obviously reflects the weakness of gravity. 

In fundamental units in which # and c are set to unity, gravity defines an 
intrinsic mass or energy scale much higher than any scale we have yet explored 
experimentally. Indeed, one of the fundamental mysteries of contemporary particle 
physics is why this mass scale is so high compared to anything else we know of. I 
will come back to this so-called hierarchy problem in due time. For the moment, 
let us ask if this new picture of gravity, new in the waning moments of the last 
century, can alleviate the hierarchy problem by lowering the intrinsic mass scale 
of gravity. 

Denote the mass scale characteristic of gravity in the (n + 3 + 1)-dimensional 
universe by Mpy,4.341) So that the gravitational potential between two objects of 
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masses m, and m» separated by a distance r < R is given by 


MyMy 1 


4 (r) a rn plin 


[Mpi(n4341) 


Note that the dependence on Mpi,341) follows from dimensional analysis: two 
powers to cancel m,m, and n powers to match the n extra powers of 1/r. For 
r > R, as we have argued, the geometric spread of the gravitational flux is cut off 
by R so that the potential becomes 


mim ii 


Vo) =m 
[Meyna3-prt" R'r 


Comparing with the observed law V (r) = (mm>/ M3)(1/ r) we obtain 
Mp 


M2 = ———Fl_ 3 
Plint3+0 ~ | Mouns3iyRT (3) 


If Mpin4341)R can be make large enough, we have the intriguing possibility that 
the fundamental scale of gravity Mpy,4341) is much lower than what we have 
always thought. 

Thus, R is bounded on one side by our desire to lower the fundamental scale 
of gravity and on the other by experiments. 


Exercise 


1.6.1. Putting in the numbers show that the case m = 1 is already ruled out. For help, see 
S. Nussinov and R. Schrock, Phys. Rev. D59: 105002, 1999, 


Chapter I.7 


Feynman Diagrams 


Feynman brought quantum field theory to the masses. 


—J. Schwinger 


Anharmonicity in field theory 


The free field theory we studied in the last few chapters was easy to solve because 
the defining path integral (1.3.14) is Gaussian, so we could simply apply (1.2.15). 
(This corresponds to solving the harmonic oscillator in quantum mechanics.) As 
I noted in Chapter I.3, within the harmonic approximation the vibrational modes 
on the mattress can be linearly superposed and thus they simply pass through each 
other. The particles represented by wave packets constructed out of these modes 
do not interact:' hence the term free field theory. To have the modes scatter off 
each other we have to include anharmonic terms in the Lagrangian so that the 
equation of motion is no longer linear. For the sake of simplicity let us add only 
one anharmonic term — io? to our free field theory and, recalling (1.3.11), try to 
evaluate 


: 4.4 2 p22] A gt 
za) = [ De oi f Bxthl@—)—me?]- dott 79) (1) 


(We suppress the dependence of Z on A.) 

Doing quantum field theory is no sweat, you say, it just amounts to doing the 
functional integral (1). But the integral is not easy! If you could do it, it would be 
big news. 


1 A potential source of confusion: Thanks to the propagation of g, the sources coupled 
to g interact, as was seen in Chapter [.4, but the particles associated with g do not interact 
with each other. This is like saying that charged particles coupled to the photon interact, but 
(to leading approximation) photons do not interact with each other. 
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Feynman diagrams made easy 


As an undergraduate, I heard of these mysterious little pictures called Feynman 
diagrams and really wanted to learn about them. I am sure that you too have 
wondered about those funny diagrams. Well, I want to show you that Feynman 
diagrams are not such a big deal: Indeed we have already drawn little spacetime 
pictures in Chapters I.3 and 1.4 showing how particles can appear, propagate, and 
disappear. 

Feynman diagrams have long posed somewhat of an obstacle for first-time 
learners of quantum field theory. To derive Feynman diagrams, traditional texts 
typically adopt the canonical formalism (which I will introduce in the next chapter) 
instead of the path integral formalism used here. As we will see, in the canonical 
formalism fields appear as quantum operators. To derive Feynman diagrams, we 
would have to solve the equation of motion of the field operators perturbatively in 
i. A formidable amount of machinery has to be developed. 

In the opinion of those who prefer the path integral, the path integral formalism 
derivation is considerably simpler (naturally!). Nevertheless, the derivation can 
still get rather involved and the student could easily lose sight of the forest for the 
trees. There is no getting around the fact that you would have to put in some effort. 

I will try to make it as easy as possible for you. I have hit upon the great 
pedagogical device of letting you discover the Feynman diagrams for yourself. 
My strategy is to let you tackle two problems of increasing difficulty, what I call 
the baby problem and the child problem. By the time you get through these, the 
problem of evaluating (1) will seem much more tractable. 


A baby problem 


The baby problem is to evaluate the ordinary integral 


+00 
ZI) =f dge tne —etvie 2) 


—& 


evidently a much simpler version of (1). 

First, a trivial point: we can always scale g — q/m so that Z =m 'F( 
but we won’t. 

For 4 = 0 this is just one of the Gaussian integrals done in the appendix of 
chapter 1.2. Well, you say, it is easy enough to calculate Z(J/) as a series in A: 
expand 


A. J 
ma? ae 


+00 L272 a x 
Z(J) = | dge 2" +2 1 — gt + 5S Ge +: | 
-_ 4! 2"! 
and integrate term by term. You probably even know one of several tricks for com- 
2 
puting f° dqe~2™ P +59 gan. you write it as (4) fr 
dqei™ 7 +54 and refer to (1.2.11). So 
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J J J J J J 
r r * r 
J J J J J J 
(a) (b) (c) 
(a2)*0J4 
Figure 1.7.1 


Aa 4 1 A.4,d 8 [~ —dm2g24s 
AN=1~ 2S 2 (4)2(4 cae dge 27 tla 3 
(J) = ( nap +37'5a GP + yy. qe (3) 

+00 
=e hid dge Ime t0 = (Ay he BEM ein! (4) 
—OO 


(There are other tricks, such as differentiating [ ree dqe- yma’ +J4 with respect 
to m? repeatedly, but I want to discuss a trick that will also work for field the- 
ory.) By expanding the two exponentials we can obtain any term in a double 
series expansion of Z(J) in A and J. [We will often suppress the overall factor 
(22 /m?)? = Z(J =0, A = 0) = Z(0, 0) since it will be common to all terms. 
When we want to be precise, we will define Z=Z(J) /Z(0, 0).] 

For example, suppose we want the term of order A and J4 in Z. We extract 
the order J® term in e//2””, namely, [1/4!(2m)*]J®, replace e~@/40@/44 y 
by —(/4)(d/dJ)*, and differentiate to get [8!(—A)/(4!)°(2m?)*]J4. Another 
example: the term of order A? and J® is 10/4)? (d/dJ)8[1/7!(m?)7]7 4 = 
[14 —A)?/(4!)76!7!2(2m)7]/°. A third example: The term of order A* and J4 
is [12!(—A)*/(4)731(2m)°]J4. Finally, the term of order A and J° is [1/2(2m?)?] 
(—A). 

You can do this as well as Ican! Do a few more and you will soon see a pattern. In 
fact, you will eventually realize that you can associate diagrams with each term and 
codify some rules. Our four examples are associated with the diagrams in Figures 
I.7.1-1.7.4, respectively. You can see, for a reason you will soon understand, that 
each term can be associated with several diagrams. I leave you to work out the rules 
carefully to get the numerical factors right (but trust me, the “future of democracy” 
is not going to depend on them). The rules go something like this: (1) diagrams 
are made of lines and vertices at which four lines meet; (2) for each vertex assign 
a factor of (—A); (3) for each line assign 1/m?; and (4) for each external end 
assign J (e.g., Figure I.7.2 has seven lines, two vertices, and six ends, giving 
~ [(—A)?/(m?)7]/°.) (Did you notice that twice the number of lines is equal to 
four times the number of vertices plus the number of ends? We will meet relations 
like that in Chapter III.2.) 
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IB le II 


(s2)72,2)6 


(d) 


Figure 1.7.2 


For obvious reasons, some diagrams (e.g., Figure I.7.1a, I.7.2a) are known as 
tree” diagrams and others (e.g., Figs. I.7.1b and I.7.3a) as loop diagrams. 

Do as many examples as you need until you feel thoroughly familiar with what 
is going on, because we are going to do exactly the same thing in quantum field 
theory. It will look much messier, but only superficially. Be sure you understand 
how to use diagrams to represent the double series expansion of Z(J) before 
reading on. Please. In my experience teaching, students who have not thoroughly 
understood the expansion of Z(J) have no hope of understanding what we are 
going to do in the field theory context. 


Wick contraction 


It is more obvious than obvious that we can expand Z(J) in powers of J, if we 
please, instead of in powers of A. As you will see, particle physicists like to classify 
in power of J. In our baby problem, we can write 


— 1 +00 Lin2g?—(a./4Ne4 cana | 
Z(J) = “7 | dge 2™ q°—(A/4Dq q° = ZO, 0) = Fe" (5) 
s=0 ~~ s=0 °° 


? The Chinese character for tree (A. Zee, Swallowing Clouds) is shown in Fig. 1.7.5. I 
leave it to you to figure out why this diagram does not appear in our Z(J). 
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e Xe | 


(g) 
(JO 


Figure 1.7.3 


The coefficient G), whose analogs are known as “Green’s functions” in field 
theory, can be evaluated as a series in A with each term determined by Wick 
contraction (1.2.10). For instance, the O(A) term in G“) is 


1 +00 1.22 
—e —sxmiq? 8 _ 8 ** 
ay? A) [. dge mt qa 


which of course better be equal? to what we obtained above for the AJ* term in Z. 
Thus, there are two ways of computing Z : you expand in A first or you expand in 
J first. 


Figure 1.7.4 


3 As a check on the laws of arithmetic we verify that indeed 7!!/(4!)? = 81/(41724. 
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Figure 1.7.5 


Connected versus disconnected 


You will have noticed that some Feynman diagrams are connected and others are 
not. Thus, Figure I.7.2a is connected while 2b is not. I presaged this at the end of 
Chapter I.4 and in Figures I.4.2 and 1.4.3. Write 


Z(J, A) = Z(J =0, Ne™Y = ZU =0,A) > (WU, yn 6) 
N=0 . 


By definition, Z(J =0, A) consists of those diagrams with no external source 
J, such as the one in Figure 1.7.4. The statement is that W is a sum of con- 
nected diagrams while Z contains connected as well as disconnected diagrams. 
Thus, Figure I.7.2b consists of two disconnected pieces and comes from the term 
(1/2) [W (J, )F in (6), the 2! taking into account that it does not matter which of 
the two pieces you put “on the left or on the right.” Similarly, Figure I.7.2c comes 
from (1/3!)[W(J, A)P. Thus, it is W that we want to calculate, not Z. If you’ve 
had a good course on statistical mechanics, you will recognize that this business 
of connected graphs versus disconnected graphs is just what underlies the relation 
between free energy and the partition function. 


Propagation: from here to there 
All these features of the baby problem are structurally the same as the correspond- 
ing features of field theory and we can take over the discussion almost immediately. 


But before we graduate to field theory, let us consider what I call a child problem, 
the evaluation of a multiple integral instead of a single integral: 


+00 f+co +0o Lo wahates 
Z(J) =| | 1 | dqidqy -+-dqy e 2% q—Xl4g’ +59 (7) 
—Co —x —x 


with gt=>-, qi. Generalizing the steps leading to (3) we obtain 
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Z(J) = jenny’ e AlAD 28/5) FANS (8) 
det[A] 


Alternatively, just as in (5) we can expand in powers of J 


Z(J) = y+ Ji = [ [ 4a) e244 @/a%g, 
= i i, qi} qi, qi, 
~o \A, 


s—o * 


= Z(0, 0) Li SiGe (9) 


which again we can expand in powers of A and evaluate by Wick contracting. 

The one feature the child problem has that the baby problem doesn’t is prop- 
agation “from here to there”. Recall the discussion of the propagator in Chapter 
1.3. Just as in (1.3.16) we can think of the index i as labeling the sites on a lattice. 
Indeed, in (1.3.16) we had in effect evaluated the “2-point Green’s function” Go? 
to zeroth order in A (differentiate (1.3.16) with respect to J twice): 


+00 
Ga — 0) = | (11 in) Mag /Z(0, 0) = (A); 
~OO } 


(see also the appendix to Chapter 1.2). The matrix element (A~'), ; describes 

propagation from i to 7. In the baby problem, each term in the expansion of Z(J) 

can be associated with several diagrams but that is no longer true with propagation. 
Let us now evaluate the “4-point Green’s function” GY , to order A: 


Gi = [- (ne )e WAda.g angi 2 4 2% + O(4”) } /Z(0, 0) 
= (AN) (A) + (A D(A) jp + (ADA) jp 


<A DAY) in(A7) jn(A Den(A Din + OC?) (10) 


The first three terms describe one excitation propagating from i to j and another 
propagating from k to /, plus the two possible permutations on this “history.” The 
order A term tells us that four excitations, propagating from i to n, from j to 7, 
from k ton, and from / to n, meet at n and interact with an amplitude proportional 
to A, where n is anywhere on the lattice or mattress. By the way, you also see why 
it is convenient to define the interaction (4./4!)g* with a 1/4!: g; has a choice of 
four q,’s to contract with, q, has three q,,’s to contract with, and so on, producing 
a factor of 4! to cancel the (1/41). 
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Perturbative field theory 


You should now be ready for field theory! 
Indeed, the functional integral in (1) (which I repeat here) 


. 4 
Z(J)= J Doel f Bxt3109)?—-m79"1-0/40 04479) (11) 


has the same form as the ordinary integral in (2) and the multiple integral in (7). 
There is one minor difference: there is no i in (2) and (7), but as I noted in Chapter 
I.2 we can Wick rotate (11) and get rid of the i, but we won’t. The significant 
difference is that J and gy in (11) are functions of a continuous variable x, while 
J and q in (2) are not functions of anything and in (7) are functions of a discrete 
variable. Aside from that, everything goes through the same way. 

As in (3) and (8) we have 


ZI) = Z(0, Oe O44 f Pwbs/ireo | Doei f #x(310)?—mg7-J9) 


= Z(0 Oye C/A f Auta sins wyt --G/2) [ff dxaty.1@)Da-y)IQ) 42) 


The structural similarity is total. 
The role of 1/m? in (3) and of A~! (8) is now played by the propagator 


4 ik-(x—y) 
De -y)= | 48 
(277)* k* —~m* +ie 
Incidentally, if you go back to Chapter 1.3 you will see that if we were in d- 
dimensional spacetime, D(x — y) would be given by the same expression with 
d*k/(27)* replaced by d?k/(2sr)?. The ordinary integral (2) is like a field theory 
in 0-dimensional spacetime: if we set d = 0, there is no propagating around and 
D(x — y) collapses to —1/m*. You see that it all makes sense. 

We also know that J/(x) corresponds to sources and sinks. Thus, if we expand 
Z(J) as a series in J, the powers of J would indicate the number of particles 
involved in the process. (Note that in this nomenclature the scattering process 
yg +g¢— + ¢ counts as a 4-particle process: we count the total number of 
incoming and outgoing particles.) Thus, in particle physics it often makes sense to 
specify the power of J. Exactly as in the baby and child problems, we can expand 
in J first: 


= 1 
Z(J) = Z(0,0) > — J) F(x) GO (x, 5 Xp) 
s=Q * 


~ 1 are | 2_ 2,2 4 
rf d'x{4lag)*— —{a /4! 
=D sey Is) f Dod / x{4[ag)?—-m?g?}-G san} 


s=0 -° 


p(x) --- e(;) (13) 
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In particular, we have the 2-point Green’s function 


1 
Z(0, 0) 


; 4 1 2 221 4 
Ga, %) = | Do ei { PH(31G 0-91-00 Oe o(x5) (14) 


the 4-point Green’s function, 


1 if d4xth 2 mo 4 
G (x1, Xp X4%4) = [ Pod festieo mg*|-(2/4D94) 


Z(0, 0) 


P(X) (Xz) 9(X3)9(X4) (15) 


and so on. [Sometimes Z(J) is called the generating functional as it generates 
the Green’s functions.] Obviously, by translation invariance, G(x), x2) does not 
depend on x, and x2 separately, but only on x; — xz. Similarly, G(x 1, x2, x3, x4) 
only depends on x; — x4, X2 ~- X4, and x3 — x4. For 1 =0, G(x), x2) reduces to 
i D(x, — x2), the propagator introduced in Chapter 1.3. While D(x, — x2) describes 
the propagation of a particle between x, and x, in the absence of interaction, 
G(x, — Xz) describes the propagation of a particle between x, and x» in the 
presence of interaction. If you understood our discussion of GY p you would know 
that G(x,, X2, X3, X4) describes the scattering of particles. 

In some sense, there are two ways of doing field theory, what I might call the 
Schwinger way (12) or the Wick way (13). 

Thus, to summarize, Feynman diagrams are just an extremely convenient way 
of representing the terms in a double series expansion of Z(J) in A and J. 

As I said in the preface, I have no intention of tuming you into a whiz at 
calculating Feynman diagrams. In any case, that can only come with practice. 
Besides, there are excellent texts devoted to the evaluation of diagrams. Instead, I 
tried to give you as clear an account as I can muster of the concept behind this 
marvellous invention of Feynman’s, which as Schwinger noted rather bitterly, 
enables almost anybody to become a field theorist. For the moment, don’t worry 
too much about factors of 4! and 2! 


Collision between particles 


As I already mentioned, I described in chapter 1.4 the strategy of setting up sources 
and sinks to watch the propagation of a particle (which I will call a meson) 
associated with the field g. Let us now set up two sources and two sinks to watch 
two mesons scatter off each other. The setup is shown in Figure 1.7.6. The sources 
localized in regions | and 2 both produce a meson, and the two mesons eventually 
disappear into the sinks localized in regions 3 and 4. It clearly suffices to find in Z 
a term containing J (x,)J (x2) J (x3) J (x4). But this is just G(x, x2, x3, x4). 

Let us be content with first order in 1. Going the Wick way we have to evaluate 
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| 


Figure 1.7.6 


(-=) | d‘w | Dg ef f P#(Z19)?—m¢7) 
Z(0, 0) 4) 


o(x Pox) (x3) (x4) 9(w)4 (16) 


Just as in (10) we Wick contract and obtain 
(-iA) | d*wD(x, ~ w) D(x, — w) D(x; — w)D(x4 — w) (17) 


As a check, let us also derive this the Schwinger way. Replace e~‘/4! 
f d4w(3/5s(w)y4 by (i/4DA f d4w(3/57(w))* and e~ ff dxdt y I) Dix-y) IVY) 
by 


444 _ 
= =| [f axa*y 1000 nso] 


To save writing, it would be sagacious to introduce the abbreviations J, for J(x,), 
f for f d*x,, and D,, for D(x, ~ xp). Dropping overall numerical factors, which 
I invite you to fill in, we obtain 


~ia fo PLL DaePop DegDanFaIpJeFaJeI IgT_, (18) 


The four (6/5 J,,)’s hit the eight J’s in all possible combinations producing many 
terms, which again I invite you to write out. Two of the three terms are discon- 
nected. The connected term is 


I.7. Feynman Diagrams 51 


x3 X4 


(a) (b) 


Figure 1.7.7 


~a f fff DawPowPcwPawIaIoIeFa (19) 
w 


Evidently, this comes from the four (6/8J,,)’s hitting J,, J;, Jy, and J, thus 
setting X,,X/,Xg, and x, to w. Compare (19) with [8!(—A)/(41?(2m7)*]J* in 
the baby problem. 

Recall that we embarked on this calculation in order to produce two mesons by 
sources localized in regions 1 and 2, watch them scatter off each other, and then 
get rid of them with sinks localized in regions 3 and 4. In other words, we set the 
source function J (x) equal to a set of delta functions peaked at x), x), x3, and 
x4. This can be immediately read off from (19): the scattering amplitude is just 
—id f, DiwPrwDP3w Daw. exactly as in (17). 

The result is very easy to understand (see Figure I.7.7a). Two mesons propagate 
from their birthplaces at x, and x, to the spacetime point w, with amplitude 
D(x, — w)D(x2 — w), scatter with amplitude —iA, and then propagate from w 
to their graves at x3 and x4, with amplitude D(w — x3)D(w — xq) [note that 
D(x) = D(—x)]. The integration over w just says that the interaction point w 
could have been anywhere. Everything is as in the child problem. 

It is really pretty simple once you get it. Still confused? It might help if you 


think of (12) as some kind of machine eo E/4D% J dwts/s.7(wyt operating on 


Z(J,4=0) =e GDS Sf AxdtyI@DE-yYI 0) 


When expanded out, Z(J, 4 =0) is a bunch of J’s thrown here and there in 
spacetime, with pairs of J’s connected by D’s. Think of a bunch of strings with the 
string ends corresponding to the J’s. What does the “machine” do? The machine 
is a sum of terms, for example, the term 


2 f 4 4 sy 5 yf 
~A | mi fa m2 Eon ae | 
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Figure 1.7.8 


When this term operates on a term in Z(/, A = 0) it grabs four string ends and 
glues them together at the point w.; then it grabs another 4 string ends and glues 
them together at the point w,. The locations w, and w, are then integrated over. 
Two examples are shown in Figure 1.7.8. It is a game you can play with a child! 
This childish game of gluing four string ends together generates all the Feynman 
diagrams of our scalar field theory. 


Do it once and for all 


Now Feynman comes along and says that it is ridiculous to go through this long- 
winded yakkety-yak every time. Just figure out the rules once and for all. 

For example, for the diagram in Figure I.7.7a associate the factor —iA with 
the scattering, the factor D(x, — w) with the propagation from x, to w, and so 
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forth—conceptually exactly the same as in our baby problem. See, you could have 
invented Feynman diagrams. (Well, not quite. Maybe not, maybe yes.) 

Just as in going from (1.4.1) to (1.4.2), it is easier to pass to momentum space. 
Indeed, that is how experiments are actually done. A meson with momentum k, 
and a meson with momentum k, collide and scatter, emerging with momenta k3 
and k, (see Figure I.7.7b). Each spacetime propagator gives us 


a‘ k. ef ko(Xqg—wW) 
— a 
~ J (Qn) k2 — m2 +ie 


D(x, — w) 


Note that we have the freedom of associating with the dummy integration variable 
either a plus or a minus sign in the exponential. Thus in integrating over w in (17) 
we obtain 


J d* we ihitke—ks—kaw — (Qa7y49 (ky + ky — kg — keg). 


That the interaction could occur anywhere in spacetime translates into momentum 
conservation k, + ky = k3 + k4. (We put in the appropriate minus signs in two of 
the D’s so that we can think of k3 and k4 as outgoing momenta.) 

So there are Feynman diagrams in real spacetime and in momentum space. 
Spacetime Feynman diagrams are literally pictures of what happened. (A trivial 
remark: the orientation of Feynman diagrams is a matter of idiosyncratic choice. 
Some people draw them with time running vertically, others with time running 
horizontally.) 


The rules 


We have thus derived the celebrated momentum space Feynman rules for our scalar 
field theory: 


1. Draw a Feynman diagram of the process (Fig. I.7.7b for the example we just 
discussed). 

2. Label each line with a momentum k and associate with it the propagator 
i/(k? ~ m? + ie). 

3. Associate with each interaction vertex the coupling —iA and (27)4 
8S"; ki — 2) kj), forcing the sum of momenta 5°; k; coming into the 
vertex to be equal to the sum of momenta }° j *j going out of the vertex. 

4. Momenta associated with internal lines are to be integrated over with the 


measure £4. Incidentally, this corresponds to the summation over inter- 
mediate states in ordinary perturbation theory. 

5. Finally, there is a rule about some really pesky symmetry factors. They are 
the analogs of those numerical factors in our baby problem. As a result, 
some diagrams are to be multiplied by a symmetry factor such as }. These 


originate from various combinatorial factors counting the different ways in 
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which the (/6/)’s can hit the J’s in expressions such as (18). I will let you 
discover a symmetry factor in Exercise I.7.2. 


We will illustrate by examples what these rules (and the concept of internal 
lines) mean. 

Our first example is just the diagram (Fig. 1.7.7b) that we have calculated. 
Applying the rules we obtain 


4 . 
~iA (2748 k, + k> —_— k —_— k4) I] (a-saa) 


—m2ti 
gel 4 m“+i1é 


You would agree that it is silly to drag the factor ITt_, (ate) around, since 


it would be common to all diagrams in which two mesons scatter into two mesons. 
So we append to the Feynman rules an additional rule that we do not associate 
a propagator with external lines. (This is known in the trade as “amputating 
the external legs”.) And since there is always an overall factor for momentum 
conservation we should not drag the overall delta function around either. Thus, we 
have two more rules: 


6. Do not associate a propagator with external lines. 
7. A delta function for overall momentum conservation is understood. 


The amplitude for the diagram in Figure I.7.7b is just —iA. 


The birth of particles 


As explained in Chapter I.1 one of the motivations for constructing quantum field 
theory was to describe the production of particles. We are now ready to describe 
how two colliding mesons can produce four mesons. The Feynman diagram in 
Figure 1.7.9 (compare Fig. I.7.2a) can occur in order 7 in perturbation theory. 
Amputating the external legs, we drop the factor To _,fi / k? — m* + ie]associated 
with the six external lines, keeping only the propagator associated with the one 
internal line. For each vertex we put in a factor of (—iA) and a momentum 
conservation delta function (rule 3). Then we integrate over the internal momentum 
q (tule 4) to obtain 


dq i 

~iay? ——_____(2n)*8%(k, +h — ka 

(—id) (ony gz om? + ie. my BN (ky + kp — ky — 4) 
(2)48(g — (ky + ks + ke)] (20) 


The integral over q is a cinch, giving 
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(—id)? 


(ka + ks + kg)? —m?2+ie 
(21 )45O [ky + ky — (kg hg t+ ks t+ ko (21) 


We have already agreed (Rule 7) not to drag the overall delta function around. 
This example teaches us that we didn’t have to write down the delta functions 
and then annihilate (all but one of ) them by integrating. In Figure 1.7.5 we could 
have simply imposed momentum conservation from the beginning and labeled the 
internal line as k, + ks + kg instead of q. 

With some practice you could just write down the amplitude 

+4 \2 I 
( tA) (ky + ks + ke)? _ m2 + lé (22) 


directly: just remember, a coupling (—iA) for each vertex and a propagator for 
each internal (but not external) line. Pretty easy once you get the hang of it. 


The cost of not being real 


The physics involved is also quite clear: The internal line is associated with a virtual 
particle whose relativistic 4-momentum k, + ks + kg squared is not necessarily 
equal to m*, as it would have to be if the particle were real. The farther the 
momentum of the virtual particle is from the mass shell the smaller the amplitude. 
You are penalized for not being real. 

According to the quantum rules for dealing with identical particles, to obtain 
the full amplitude we have to symmetrize among the four final momenta. One way 
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k, +k,-k 


Figure 1.7.10 


of saying it is to note that the line labeled k3 in Figure (1.7.9) could have been 
labeled k4, ks, or kg, and we have to add all four contributions. 

To make sure that you understand the Feynman rules I insist that you go through 
the path integral calculation to obtain (21) starting with (12) and (13). 

I am repeating myself but I think it 1s worth emphasizing again that there is 
nothing particularly magical about Feynman diagrams. 


Loops and a first look at divergence 


Just as in our baby problem, we have tree diagrams and loop diagrams. So far we 
have only looked at tree diagrams. Our next example is the loop diagram in Figure 
1.7.10 (compare Fig. I.7.3a.) Applying the Feynman rules, we obtain 


i 


2-1) ee 2h 0CUC(<( 


As above, the physics embodied in (23) is clear: As k ranges over all possible 
values, the integrand is large only if one or the other or both of the virtual particles 
associated with the two internal lines are close to being real. Once again, there is 
a penalty for not being real (see Exercise I.7.4). 

For large k the integrand goes as 1/k*. The integral is infinite! It diverges as 
f a*k(1/k*). We will come back to this apparent disaster in Chapter III.1. 

With some practice, you will be able to write down the amplitude by inspection. 
As another example, consider the three-loop diagram in Figure 1.7.11 contributing 
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in O(A*) to meson-meson scattering. First, for each loop pick an internal line and 
label the momentum it carries, p, g, and r in our example. There is considerable 
freedom of choice in labeling—your choice may well not agree with mine, but 
of course the physics should not depend on it. The momenta carried by the other 
internal lines are then fixed by momentum conservation, as indicated in the figure. 
Write down a coupling for each vertex, and a propagator for each internal line, and 
integrate over the internal momenta p, g, and r. Thus, without worrying about 
symmetry factors, we have the amplitude 


(ia) | ap dq grt 
(2x4 (20)4* (20)4 p? — m? + ie (ky + ky — py? —m? + ie 
i i 
g*—m?+ie(p—q—r)*—m*+ie 
i i 
So 24 
r2—m? +ie (ki th, —r)* —m? +ie (24) 


Again, this triple integral also diverges: It goes as { d 12 p/P), 


An assurance 


When I teach quantum field theory, at this point in the course some students 
get unaccountably very anxious about Feynman diagrams. I would like to assure 
the reader that the whole business is really quite simple. Feynman diagrams can 
be thought of simply as pictures in spacetime of the antics of particles, coming 
together, colliding and producing other particles, and so on. One student was 
puzzled that the particles do not move in straight lines. Remember that a quantum 
particle propagates like a wave; D(x — y) gives us the amplitude for the particle 
to propagate from x to y. Evidently, it is more convenient to think of particles in 
momentum space: Fourier told us so. We will see many more examples of Feynman 
diagrams, and you will soon be well acquainted with them. Another student was 
concerned about evaluating the integrals in (23) and (24). I haven’t taught you how 
yet, but will eventually. The good news is that in contemporary research on the 
frontline few theoretical physicists actually have to calculate Feynman diagrams 
getting all the factors of 2 right. In most cases, understanding the general behavior 
of the integral is sufficient. But of course, you should always take pride in getting 
everything right. In Chapters IT.6, II.6 & If.7 I will calculate Feynman diagrams 
for you in detail, getting all the factors right so as to be able to compare with 
experiments. 


Vacuum fluctuations 


Let us go back to the terms I neglected in (18) and which you are supposed to have 
figured out. For example, the four [5/5/(w)]’s could have hit J,, Jz, J,, and J), 
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Figure I.7.11 


in (18) thus producing something like 


—in [fff PacPrytolotel (| PwwPov)- 
w 


The coefficient of J (x) J (x2) J(x3) J (xq) is then D,3Do4(—-id f, DixwP ww) plus 
terms obtained by permuting. 

The corresponding physical process is easy to describe in words and in pictures 
(see Figure I'7.12). The source at x, produces a particle that propagates freely 
without any interaction to x3, where it comes to a peaceful death. The particle 
produced at x2 leads a similarly uneventful life before being absorbed at x4. 
The two particles did not interact at all. Somewhere off at the point w, which 
could perfectly well be anywhere in the universe, there was an interaction with 
amplitude —iA. This is known as a vacuum fluctuation: As explained in Chapter 
I.1, quantum mechanics and special relativity inevitably cause particles to pop out 
of the vacuum, and they could even interact before vanishing again into the vacuum. 
Look at different time slices (one of which is indicated by the dotted line) in Figure 
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X3 X4 


Figure I.7.12 


1.7.12. In the far past, the universe has no particles. Then it has two particles, then 
four, then two again, and finally in the far future, none. We will have a lot more 
to say in Chapter VIII.2 about these fluctuations. Note that vacuum fluctuations 
occur also in our baby and child problems (see, e.g., Figs. I.7.1c, 1.7.2f,g, 1.7.3f,g, 
and so forth). 


Two words about history 


I believe strongly that any self-respecting physicist should learn about the history 
of physics, and the history of quantum field theory is among the most fascinating. 
Unfortunately, I do not have the space to go into it here.* The path integral approach 
to field theory using sources J(x) outlined here is mainly associated with Julian 
Schwinger, who referred to it as “sorcery” during my graduate student days (so that 
I could tell people who inquired that I was studying sorcery in graduate school.) In 
one of the many myths retold around tribal fires by physicists, Richard Feynman 
came upon his rules in a blinding flash of insight. In 1949 Freeman Dyson showed 
that the Feynman rules which so mystified people at the Pocono conference a year 
earlier could actually be obtained from the more forma! work of Julian Schwinger 
and of Shin-Itiro Tomonaga. 


Exercises 


1.7.1. Work out the amplitude corresponding to Figure [.7.12 in (18). 


4 An excellent sketch of the history of quantum field theory is given in Chapter | of The 
Quantum Theory of Fields by S. Weinberg. 
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Derive (23) from first principles, that is from (12). It is a bit tedious, but straight- 
forward. You should find a symmetry factor 3. 


Draw all the diagrams describing two mesons producing four mesons up to and 
including order 42. Write down the corresponding Feynman amplitudes. 


By Lorentz invariance we can always take k, + k, = (E, 6) in (23). The integral 
can be studied as a function of E. Show that for both internal particles to become 
real E must be greater than 2m. Interpret physically what is happening. 


Chapter 1.8 


Quantizing Canonically and 
Disturbing the Vacuum 


Always create before we annihilate, not the other way around. 
—Anonymous 


Complementary formalisms 


I adopted the path integral formalism as the quickest way to quantum field theory. 
But I must also discuss the canonical formalism, not only because historically it 
was the method used to develop quantum field theory, but also because of its con- 
tinuing importance. Interestingly, the canonical and the path integral formalisms 
often appear complementary, in the sense that results difficult to see in one are 
often clear in the other. 

I will be rather brief on the canonical formalism, partly because it is the standard 
approach treated exhaustively in most of the texts on quantum field theory. 


Heisenberg and Dirac 


Let us begin with a lightning review of Heisenberg’s approach to quantum me- 
chanics. Given a classical Lagrangian for a single particle L = 1g? — V(q) (we 
set the mass equal to 1), the canonical momentum is defined as p = 6L/5q = g. 
The Hamiltonian is then given by H = pg — L = p?/2 + V(q). Heisenberg pro- 
moted position q(t) and momentum p(t) to operators by imposing the canonical 
commutation relation 


Operators evolve in time according to 


dp _, __y 
at ={H, p)=—-V(q) (2) 
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and 


dq. 
Tt ={H,q]=p (3) 
In other words, operators constructed out of p and q evolve according to O(t) = 
e' 4 QO(O)e 4" In (1) p and g are understood to be taken at the same time. We 
obtain the operator equation of motion g = ~V’(qg) by combining (2) and (3). 
Following Dirac, we invite ourselves to consider at some instant in time the 
operator a = (1/./2)(wq + ip) with some parameter w. From (1) we have 


ja, a] =1 (4) 


The operator a(t) evolves according to 


da . 1 ; _ @ . 1 I 
ap Tile Jaw Od + P= i [2 (w+4v@) 


which can be written in terms of a anda . The ground state |0 > is defined as the 
State such that a|0 >= 0. 

In the special case V’(q) = w*q we get the particularly simple result a = 
—iwa. This is of course the harmonic oscillator L = 1g* — 1w*q? and H = 
w(ata + 3). 

The generalization to many particles is immediate. Starting with 


L=)> 34a — V(qi. 2, +++ 4) 
a 


we have p, = 6L/éq, and 
[Palt), a,(t)] = —i8g, (3) 


The generalization to field theory is almost as immediate. In fact, we just use 
our handy-dandy substitution table (1.3.6) and see that in D—dimensional space L 
generalizes to 


L= / a x{1 (gp)? — m2?) — u(y)} (6) 


where we denote the anharmonic term (the interaction term in quantum field 
theory) by u(y). The canonical momentum density conjugate to the field g(x, t) 
is then 


m éL + 
WE ry C09) (7) 


so that the canonical commutation relation at equal times now reads 


[7 (X, t), eX’, D1 = (dp, 1), GR, 1) = 16" — X*) (8) 
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(and of course also [7 (x, t), w(x’, t)] =0 and [g(Z, t), g(x’, t)] = 0.) Note that 
Sap in (5) gets promoted to 6“?)(¥ — x’) in (8). You should check that (8) has the 
correct dimension. Turning the canonical crank we find the Hamiltonian 


H= f asirG, tdgg(X, t) — L] 


= / a? x{4[x? + (Vy)? + m9] + uy)} (9) 


For the case u = 0, corresponding to the harmonic oscillator, we have a free 
scalar field theory and can go considerably farther. The field equation reads 


(a7 + m)p =0 (10) 


Fourier expanding, we have 


5 dP? > z= =o oo 
g@,t)= | abe #9 4 ater} a1 
¥ (27)"2a, 
with co, = +k? + m? so that the field equation (10) is satisfied. The peculiar 
looking factor (204) 72 is chosen so that the canonical relation 


[a(k), a! (k)] = 8 OK - (12) 


appropriate for creation and annihilation operators implies the canonical commu- 
tation [agp (X, t), o(%’, t)] = —18 (¥ — ¥’) in (8 ). You should check this but you 
can see why the factor (20,72 is needed since in dog a factor w, is brought down 
from the exponential. 

As in quantum mechanics, _the vacuum or ground state |0) is defined by the 
condition a(k) [O) = 0 for all k. Let us calculate (Ol o(X, to, 0) |O) for t > 0. 
Of the four terms a‘a', a‘a, aa‘, and aa in the product of the two fields only aa’ 
survives, and thus using (12) we obtain f[[d?k/(27)?2a,Je?@#"-**) | In other 
words, if we define the time-ordered product 


Tle(x)o(yy] = 0° — ype”) + 00° — %)G~~)e(~) (13) 
we find 
(0| Tig, t)g@, 0)]|0) = 


dPk ~i(wt-7-z) +i (ot—K-Z) 
Dabo Oe HR) O(_t)e ] (14) 
k 


Go back to (1.3.23). We discover that (0| T[g(x)g(0)]|0) =i D(x), the propagator 
for a particle to go from 0 to x we obtained using the path integral formalism. This 
further justifies the ie prescription in (1.3.20). The physical meaning is that we 
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always create before we annihilate, not the other way around. This is a form of 
causality as formulated in quantum field theory. 

Note that hermiticity fixes the second term in the square bracket in (11) in terms 
of the first. If ¢ 4 g" (as we will see in Chapter I.9) then the two terms would have 
been different. This important case is developed in Exercise I.8.2, an example of 
a must-do exercise whose result we will need later. 


The energy of the vacuum 


As an instructive exercise let us calculate for the free scalar field theory the 
expectation value 


(0| H|0) = ; J d?x(0| x? + (Vo)? + mp? [0) 


which we may loosely refer to as the “energy of the vacuum.” It is merely a matter 
of putting together (7), (11), and (12). Let us focus on the third term in (0| A |0), 
which in fact we have already computed since 


(0| o(%, t)g(%, t) 10) = 0} gO, 0) eG, 0) |0) 
lim (0| 9(%, t)y(, 0) |0) 


x,t>0,0 


—i (wt —k-) 


tim f oe 
x,t0,0 (27) 2a, 


The first equality follows from translation invariance, which also implies that 
the factor f d?x in (0| H |0) can be immediately replaced by V, the volume of 
space. The calculation of the other two terms proceeds in much the same way: for 
example, the two factors of V in (Vo) just bring down a factor of Re. Thus, 


(0| H 0) =V 4k |, (w? +i +m] = jv fs lia, (16) 
(27)? 2, (21)P ? 


upon restoring /t. 

You should find this result at once gratifying and alarming, gratifying because 
we recognize it as the zero point energy of the harmonic oscillator integrated over 
all momentum modes and over all space and alarming because the integral over 
k clearly diverges. But we should not be alarmed: The energy of any physical 
configuration, for example, the mass of a particle, is to be measured relative to 
the “energy of the vacuum.” We ask for the difference in the energy of the world 
with and without the particle. In other words, we could simply define the correct 
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Hamiltonian to be H — {0} H |0). We will come back to some of these issues in 
Chapters II.5, 01.1, and VIHI.2. 


Casimir effect 


But what if we disturb the vacuum? 

Of course, it is not just the scalar field that contributes to the energy of the 
vacuum. The electromagnetic field, for instance, also undergoes quantum fluctu- 
ations and would contribute to the energy density ¢ of the vacuum something like 
2 f[d?k/(27)*}4he,. In 1948 Casimir had the brilliant insight that we could dis- 
turb the vacuum and produce a shift Ae. While ¢ is not observable, Ae should be 
observable since we can control how we disturb the vacuum. In particular, Casimir 
considered introducing two parallel “perfectly” conducting plates (formally of zero 
thickness and infinite extent) into the vacuum. The variation of Ae with the dis- 
tance d between the plates would lead to a force between them, known as the 
Casimir force. 

Call the direction perpendicular to the plates the x-axis. Because of the bound- 
ary conditions that the electromagnetic field must satisfy on the plates, the wave 
vector can only take on the values (7n/d, k,, k,), with n an integer. Thus, the 
energy per unit area between the plates is changed to 


dk,dk, [an 
>| omry gr tite 


To calculate the force, we vary d, but we also have to worry about how the 
energy density outside the two plates varies. There is a clever trick for avoiding 
this worry: we introduce a third plate! See Figure 1.8.1. We hold the two outer 
plates fixed and move only the inner plate. Now we don’t have to worry about the 
world beyond the plates. The separation L between the two outer plates can be 
taken to be as large as we like. 

In the spirit of this book (and my philosophy) of avoiding computational tedium 
as much as possible, I propose two simplifications: (1) do the calculation for a 
massless scalar field instead of the electromagnetic field so that we won’t have to 
worry about polarization and stuff like that, and (2) retreat to a (1 + 1)-dimensional 
spacetime so that we won’t have to integrate over k, and k,. As you will see, 
the calculation is exceedingly instructive and gives us an early taste of the art of 
extracting finite physical results from seemingly infinite expressions, known as 
regularization, that we will study in Chapters III. 1-3. 

With this setup, the energy is given by E = f(d) + f(L — d) with 


oO 
1 
d) = —— 17 
fa=> dX n (17) 
since the modes are given by sin(nmx/d) (n =1,..., 00), with the corresponding 


energy w, =n71/d. 
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<—$$______ | —_—___> 


Figure 1.8.1 


Aagh! What do we do with )>~° , n? None of the ancient Greeks from Zeno on 
could tell us. 

What they should tell us is that we are doing physics, not mathematics! Physical 
plates cannot keep arbitrarily high frequency waves from leaking out. To incorpo- 
rate this piece of all-important physics we should introduce a factor e~7®" so that 
modes with w, >> a~! do not contribute: They don’t see the plates! Thus, we have 


tT 1a 1a 1] 
d) = — ~ann/d = mannfd_ 
fa) od 2" 23a 2! 2 da 1—e-47/4 
an/d 
It é (18) 


~ 2d (eat/d — Wye" 
Since we want a! to be large, we take the limit a small so that 


d ais a) 
d) = ——= —>— + O(a’). 19 
fd) = 5 — 545 + 0@") (19) 
Note that f(d) blows up as a — 0, as it should since we are then back to (17). 
But the force between two conducting plates shouldn’t blow up. Experimentalists 
might have noticed that by now! 
Well, the force is given by the negative of 


OE _ yf (W—-ad=(2. 4+ 
a 1 O-FE a=(S5+s 


nx 1 1 nthe 


—_ ——_) = 
a>024 d* (L—d)? L>d 24d? 


+ )-@>L-d) 


(20) 
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The 1/a* divergence cancels. 

Behold, our beloved physics makes sense! We have restored fi to underscore 
the quantum nature of the force. Since dE /dd > 0, the Casimir force between 
two plates is attractive. In an experimental tour de force, this tiny force has been 
measured. 


Nobody is perfect 


In the canonical formalism, time is treated differently from space, and so one 
might worry about the Lorentz invariance of the resulting field theory. In the 
standard treatment given in many texts, we would go on from this point and use 
the Hamiltonian to generate the dynamics, developing a perturbation theory in the 
interaction u(y). After a number of formal steps, we would manage to derive the 
Feynman rules, which manifestly define a Lorentz invariant theory. 

Historically, there was atime when people felt that quantum field theory should 
be defined by its collection of Feynman rules, which gives us a concrete procedure 
to calculate measurable quantities such as scattering cross sections. An extreme 
view along this line held that fields are merely mathematical crutches used to 
help us arrive at the Feynman rules and should be thrown away at the end of the 
day. 

This view became untenable starting in the 1970s when it was realized that there 
is a lot more to quantum field theory than Feynman diagrams. Field theory contains 
nonperturbative effects invisible by definition to Feynman diagrams. Many of these 
effects, which we will get to in due time, are more easily seen using the path integral 
formalism. 

As I noted, the canonical and the path integral formalisms often appear to be 
complementary, and I will refrain from entering into a discussion about which 
formalism is superior. In this book, I adopt a pragmatic attitude and use whichever 
formalism happens to be easier for the problem at hand. 

Let me mention however some particularly troublesome features in each of 
the two formalisms. In the canonical formalism fields are quantum operators 
containing an infinite number of degrees of freedom, and sages once debated such 
delicate questions as how products of fields are to be defined. On the other hand, 
in the path integral formalism, plenty of sins can be swept under the rug known as 
the integration measure (see Chapter IV.7.) 


Appendix 


It may seem a bit puzzling that in the canonical formalism the propagator has to be defined 
with time ordering, which we did not need in the path integral formalism. To resolve this 
apparent puzzle, it suffices to look at quantum mechanics. 

Let A[q(t)] be a function of qg, evaluated at time r,. What does the path integral 


Lope . 
f Da@) Algae’ fo #b4,9) represent in the operator language? Well, working backward 
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to (1.2.3) we slip A[g(¢,)] into the appropriate factor of (g;4,|e7'#*" |q;), where appro- 
priate means that the time ¢, occurs between the times j5¢ and (j + 1)éf for that j. In 
(qjaile 4" Alg (4) ]1¢;) we replace the c-number A[g(1,)] with the operator A[4(¢,)]. The 
integral represents (q |e!" —V A[q(1,)]e "4" |[q7). 

We are now prepared to ask the more complicated question: What does the path integral 


| Da(t) Alg@)1Blg (le! iM dtL(q4,q) 


represent in the operator language? Here B[q (12)]is some other function of g evaluated at 
time 1,. So we also slip B[g (t,)] into the appropriate factor in (1.2.3) and replace B[q (f)] 
by B[g(t2)]. But now we see that we have to keep track of whether f; or 7, is the earlier of 
the two times. If f is earlier, the operator A[q(7,)] would appear to the left of the operator 
B[g(t)], and if t, is earlier, vice versa. Explicitly, if t, is earlier than t,,we would have the 
sequence 


eT AT Algae 4? BIG(n le 8? 
Thus, we define the time-ordered product 
T{A[9t)) ]B[g (2) I} = Ot, — 12) ALG CD IBIG ()] + OG — BIG ALG] 21) 
We learned that 


. ft . 
(ari T{Alg (ty) 1BIG@) Ne"? Ig;) = | Dq(t) Alg(t) IBlg(ayle' fo #449 (22) 


The concept of time ordering does not appear on the right-hand side, but is essential on 
the left hand side. 


Exercises 


1.8.1. Some authors prefer to remove the square root sign in (11) to define a(k) and 
a'(k) accordingly. Show that for an arbitrary function f 


+ 3 => 
/ d4k5 (2 — m)0(k) f (, B) = | FE Fwy, b. 
2H, 


Argue that d°k/2a,; is thus a Lorentz invariant measure. (Hint: Lorentz trans- 
formations cannot change the sign of k°.] Thus, the creation and annihilation 
operators defined by these authors are Lorentz covariant. Work out their commu- 
tation relation. 


1.8.2. Calculate (| H |k), where |k) = a‘ (K)|0). 


1.8.3. We have discussed a hermitean (often called real in a minor abuse of terminology) 
scalar field. Consider instead a complex scalar field governed by £ = ay’ ae - 
mo 9. Write down the Euler-Lagrange equations of motion treating ¢ and gy 


1.8.4. 


1.8.6. 
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as independent variables. The nonhermiticity of ¢ means that we have to replace 
(11) by 


[a (ke (ot —K-X) + bi (be! (at-kD)) (23) 


(x o=| ak 
on J Qn) 2a, 


Show that the canonical commutation relations imply that (a, a’) and (6, b') 
form two independent sets of creation and annihilation operators. Calculate 
(0| T [(x)g' (0)]{0). [Hint: dog is conjugate to g*, not ¢.] 


Using the equations of motion, verify that the current J, =i (9 3,9 _ 19'9) 
is conserved and that the corresponding charge is given by Q = [{ dP xJy(x) = 
f dk{a‘(k)a(k) — b*(E)b()]. We see that the particle created by a‘ (call it the 
“particle”) and the particle by b’ (call it the “antiparticle”) carry opposite charge. 
We see that g ‘creates a particle and annihilates an antiparticle, that is, it produces 
one unit of charge. The field g does the opposite. Show that [@, ¢(x)] = —¢(x). 


. For a field A show that for g° > 0 


Im{i | d‘xe!4* (0| TLA(x) A(0)] |0)} = ; | d4xe!4* (0| [A(x), A(O)](0) (24) 


by inserting 1= )~ |n)(n| (with |n) a complete set of states) between A(x) 


and A(0) on the left-hand side. [Hint: Use the integral representations 0(t) = 
—i f (dw /2m)e' /(@ — ie) and 6(—t) =! f (dw/2n)e' /(w +ie).] 


Show off your skill in doing integrals by calculating the Casimir force in (3 + 1)- 
dimensional spacetime. For help, see M. Kardar and R. Golestanian, Rev. Mod. 
Phys. 71-1233, (1999); J. Feinberg, A. Mann, and M. Revzen, Ann. Phys. 288:03, 
(2001). 


Chapter 1.9 
Symmetry 


Symmetry, transformation, and invariance 


The importance of symmetry in modern physics cannot be overstated.! 

When a law of physics does not change upon some transformation, that law is 
said to exhibit a symmetry. 

I have already used Lorentz invariance to restrict the form of an action. Lorentz 
invariance is of course a symmetry of spacetime, but fields can also transform in 
what is thought of as an internal space. Indeed, we have already seen a simple 
example of this. I noted in passing in Chapter I.3 that we could require the action 
of a scalar field theory to be invariant under the transformation g — ~g and so 
exclude terms of odd power in g, such as g°, from the action. 

With the g? term included, two mesons could scatter and go into three mesons, 
for example by the diagrams in (Fig. I.9.1). But with this term excluded, you 
can easily convince yourself that this process is no longer allowed. You will 
not be able to draw a Feynman diagram with an odd number of external lines. 
(Think about modifying the integral in our baby problem in Chapter I.7 to 
~~ dqe-2™ 9 - aq°-q"+Jq .) Thus the simple reflection symmetry g — —¢ im- 
plies that in any scattering process the number of mesons is conserved modulo 2. 

Now that we understand one scalar field, let us consider a theory with two scalar 
fields y, and @ satisfying the reflection symmetry g, > —g, (a = lor 2): 


L= 5801) — sigh - “194 + (0x)? - smig3 - “28 - 5010, (1) 
We have two scalar particles, call them 1 and 2, with mass m, and m>. To lowest 
order, they scatter in the processes 1+ 1— 1+ 1,2+2—42+42,14+2->1+ 
2,1+1+2+42,and2+2— 1+ 1 (convince yourself). With the five parameters 
M1, My, Ay, Ao, and p completely arbitrary, there is no relationship between the 
two particles. 

It is almost an article of faith among theoretical physicists, enunciated forcefully 
by Einstein among others, that the fundamental laws should be orderly and simple, 


‘A. Zee, Fearful Symmetry. 
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Figure 1.9.1 


rather than arbitrary and complicated. This orderliness is reflected in the symmetry 
of the action. 

Suppose that m, =m and A, = /.; then the two particles would have the same 
mass and their interaction, with themselves and with each other, would be the same. 
The Lagrangian £ becomes invariant under the interchange symmetry ¢; <—> #2. 

Next, suppose we further impose the condition p =A; = Az so that the La- 
grangian becomes 


2 

£= 2 [ apy? + @os"| - 5m? (v2 +03)-F (v8 +82) @ 
and is now invariant under the 2-dimensional rotation {g,(x) — cos 8 g(x) + 
sin 0 @(x), @2(x) > — sin 8 g(x) + cos @ g2(x)} with @ an arbitrary angle. We 
say that the theory enjoys an “internal” 5O(2) symmetry, internal in the sense that 
the transformation has nothing to do with spacetime. In contrast to the interchange 
symmetry ~; <—> ¢» the transformation depends on the continuous parameter @, 
and the corresponding symmetry is said to be continuous. 

We see from this simple example that symmetries exist in hierarchies. 


Continuous symmetries 


If we stare at the equations of motion (9? + m7), + 4¢7y, = 0 long enough 
we see that if we define J* =i(p,0"g) — 9,0"), then 0, J" = i(y,87¢2 — 
797) = 0: so that J“ is a conserved current. The corresponding charge Q = 
f dx J°, just like electric charge, is conserved. 

Historically, when Heisenberg noticed that the mass of the newly discovered 
neutron was almost the same as the mass of a proton, he proposed that if electro- 
magnetism was somehow turned off there would be an internal symmetry trans- 
forming a proton into a neutron. 

An internal symmetry restricts the form of the theory, just as Lorentz invariance 
restricts the form of the theory. Generalizing our simple example, suppose we 
want to construct a field theory containing N scalar fields g,, witha =1,---,N 
such that the theory is invariant under the transformations g, > R,pp, (repeated 
indices summed), where the matrix R is an element of the rotation group SO(N) 
(see Appendix B for a review of group theory). The fields @, transform as a vector 
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“M8254 + 84cObd + 50 ) 


Figure 1.9.2 


= (¢), ---,@y). Wecan form only one basic invariant, namely the scalar product 
PP =Gq%, =G" (as always, repeated indices are summed unless otherwise 
specified), The Lagrangian is thus restricted to have the form 


_ lf aay2 2222] A a22 
£ => [00 m5? 7@) (3) 


The Feynman rules are given in Fig. 1.9.2. When we draw Feynman diagrams, each 
line carries an internal index in addition to momentum. 

Symmetry manifests itself in physical amplitudes. For example, imagine calcu- 
lating the propagator iD,,(x) = f Dge'*g,(x)g,(0). We assume that the measure 
D@ is invariant under SO(N). By thinking about how D,,(x) transforms under 
the symmetry group SO(N) you see easily (Exercise 1.9.2) that it must be pro- 
portional to 6,,. You can check this by drawing a few Feynman diagrams or by 
considering an ordinary integral { dge~>\q_q,. No matter how complicated a 
diagram you draw (Fig. 1.9.3, e.g.) you always get this factor of 5,,. Similarly, 
scattering amplitudes must also exhibit the symmetry. 

Without the SO(N) symmetry, many other terms would be possible (e.g., 
PaPrPcPq for some arbitrary choice of a, b, c, and d) in (3). 

We can write R = e9'? where 0 -T = )~ , 047“ is areal antisymmetric matrix. 
The group SO(N) has N(N — 1)/2 generators, which we denote by 74. [Think 
of the familiar case of §O(3).] Under an infinitesimal transformation (repeated 
indices summed) yg, > RapGp ~ (1+ 04T 4), 4@p, oF in other words, we have the 
infinitesimal change 5g, = 04T“4 py). 


Figure 1.9.3 
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Noether’s theorem 


We now come to one of the most profound observations in theoretical physics, 
namely Noether’s theorem, which states that a conserved current is associated with 
each generator of a continuous symmetry. The appearance of a conserved current 
for (2) is not an accident. 

As ts often the case with the most important theorems, the proof of Noether’s 
theorem is astonishingly simple. Denote the fields in our theory generically by g, 
Since the symmetry is continuous, we can consider an infinitesimal change d¢,,. 
Since £ does not change, we have 


bh bh 
0=sL= Yq + 50.Pa 
dfg 53,,q 
bL bh 
= 69, + ——3,,6 4 
bPg Ya 58 Ga * Ya ® 


We would have been stuck at this point, but if we use the equations of motion 
bL/59,4 = 4,,(8£/50,,¢,) we can combine the two terms and obtain 


64 
O=d, | ——é 5 
i ( 53,04 +] (5) 
If we define 
ea 8 gy (6) 
59Ga 


then (5) says that 0,,J =0. We have found a conserved current! [It is clear from 
the derivation that the repeated index a is summed in (6)]. 

Let us immediately illustrate with the simple scalar field theory m (3). Plugging 
5G = 94(T4),,¢} into (6) and noting that 94 is arbitrary, we obtain N(N — 1)/2 
conserved currents Ja = Bn PatT “arb one for each generator of the symmetry 
group SO(N). 

In the special case N = 2, we can define a complex field g = (¢, + ig2)/ /2. 
The Lagrangian in (3) can be written as 


L = 89 dy — my'9 — Ay 9)’, 


and is clearly invariant under y — e'°g and yg > e 8g" We find from (6) that 
J, =i (y',9 — 3,9 9), the current we used in Exercise 1.8.4. Mathematically, 
this is because the groups SO(2) and U(]) are isomorphic (see Appendix B). 
For pedagogical clarity I have used the example of scalar fields transforming as 
a vector under the group $ O(N). Obviously, the preceding discussion holds for an 
arbitrary group G with the fields » transforming under an arbitrary representation 
R of G. The conserved currents are still given by J A = 8,PalT)a¥r with T4 
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the generators evaluated in the representation R. For example, if g transform as 
the 5-dimensional representation of 50 (3) then T4 is a 5 by 5 matrix. 

For physics to be invariant under a group of transformations it is only necessary 
for the action to be invariant. The Lagrangian density £ could very well change by 
a total divergence: 5£ = d,,K", provided that the relevant boundary term could be 
dropped, Then we would see immediately from (5) that all we have to do to obtain 
a formula for the conserved current is to modify (6) to J” = 54/50,,¢,5@_ — K*. 
As we will see in Chapter VIII.4, many supersymmetric field theories are of this 


type. 


Charge as generators 


Using the canonical formalism of Chapter I.8, we can derive an elegant result for 
the charge associated with the conserved current 


Qa f exs={ as st S@q 
b90G, 


Recognizing that 5£ /dd9q@,, is just the canonical momentum conjugate to the field 
aq, we see that 


LQ, Pa) = bq (7) 


The charge operator generates the corresponding transformation on the fields. An 
important special case is for the complex field g in SO(2) ~ U(1) theory we 
discussed; then [Q, g] = ¢ and e!P Qoei9@ = eo, 


Exercises 


1.9.1. Some authors prefer the following more elaborate formulation of Noether’s theo- 
rem. Suppose that the action does not change under an infinitesimal transfor- 
mation 6¢,(x) = gaya [with 04 some parameters labeled by A and vA some 
function of the fields y, (x) and possibly also of their first derivatives with respect 
to x]. It is important to emphasize that when we say the action S does not change 
we are not allowed to use the equations of motion. After all, the Euler-Lagrange 
equations of motion follow from demanding that 6S = 0 for any variation d¢, 
subject to certain boundary conditions. Our scalar field theory example nicely il- 
lustrates this point, which is confused in some books: 85 = 0 merely because S 
is constructed using the scalar product of O(N) vectors. 

Now let us do something apparently a bit strange. Let us consider the infinites- 
imal change written above but with the parameters 04 dependent on x. In other 
words, we now consider 69, (x) = 64(x) VA. Then of course there is no reason 
for 5S to vanish but, on the other hand, we know that since 65 does vanish when 
04 is constant, 5S must have the form 65 = f d*x J#(x) 0,04 (x). In practice, this 


1.9.2. 


1.9.3. 


1.9.4. 
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gives us a quick way of reading off the current J“ (x); it is just the coefficient of 
4,,04(x) in 5S. 
Show how all this works for the Lagrangian in (3). 


Show that D,,(x) must be proportional to 6,, as stated in the text. 


Write the Lagrangian for an SO (3) invariant theory containing a Lorentz scalar 
field g transforming in the 5-dimensional representation up to quartic terms. 
[Hint: It is convenient to write g as a 3 by 3 symmetric traceless matrix.] 


Add a Lorentz scalar field 1 transforming as a vector under SO (3) to the La- 
grangian in Exercise 1.9.3, maintaining SO (3) invariance. Determine the Noether 
currents in this theory. Using the equations of motion, check that the currents are 
conserved. 


Chapter 1.10 
Field Theory in Curved Spacetime 


General coordinate transformation 


In Einstein’s theory of gravity, the invariant Minkowskian spacetime interval ds? = 
nyydx"dx” = (dt)? — (dx)? is replaced by ds? = g,,,dx"dx”, where the metric 
tensor g,,,(x) is a function of the spacetime coordinates x. The guiding principle, 
known as the principle of general covariance, states that physics, as embodied in the 
action $, must be invariant under arbitrary coordinate transformations x > x’(x). 
More precisely, the principle! states that with suitable restrictions the effect of a 
gravitational field is equivalent to that of a coordinate transformation. 
Since 


th 1a 
, Ox" Ox 
AT 9x ax 


ds*=g) dx" dx" =g dx"dx” =g,,,dx"dx” 


the metric transforms as 


ax’ ax’? 


axH axY 


Big (X) = 8uv(*) () 


The inverse of the metric g” is defined by g#"g,,, = én. 
A scalar field by its very name does not transform: g(x) = g’(x’). The gradient 
of the scalar field transforms as 


ns t,t 
ax” ag’) _ ax" ax” yl) 


8 PO) = age  ggu on? 


By definition, a (covariant) vector field transforms as 
A,()= oA 4, &') 
so that 0,,g(x) is a vector field. Given two vector fields A,,(x) and B,(x), we 


can contract them with g"(x) to form g¥”(x)A,,(x)B,(x), which, as you can 


‘For a precise statement of the principle of general covariance, see S. Weinberg, Grav- 
itation and Cosmology, p. 92. 
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immediately check, is a scalar. In particular, g4”(x)d,,p(x)0,g() is a scalar. Thus, 
if we simply replace the Minkowski metric n“” in the Lagrangian & = 3[{(@g)* — 
mg] = 4(n“”d,,pa,g — m?g") by the Einstein metric g“”, the Lagrangian is 
invariant under coordinate transformation. 

The action is obtained by integrating the Lagrangian over spacetime. Under a 
coordinate transformation d*x’ = d+x det(ax’/ax). Taking the determinant of (1), 
we have 


th 4 Ia \ 72 
g =det g,,, = det Bho ana aav = g’ E (=) (2) 
We see that the combination d+x,/—g = d*x’,/—g’ is invariant under coordinate 
transformation. 

Thus, given a quantum field theory we can immediately write down the theory 
in curved spacetime. All we have to dois promote the Minkowski metric »“” in our 
Lagrangian to the Einstein metric g“” and include a factor ,/—g in the spacetime 
integration measure.” The action S would then be invariant under arbitrary coordi- 
nate transformations. For example, the action for a scalar field in curved spacetime 
is simply 


= J d'x J=E4(9""8,,98,9 — mp?) 3) 


(There is a slight subtlety involving the spin 4 field that we will talk about in Part 
II. We will eventually come to it in Chapter VIII.1.) 

There is no essential difficulty in quantizing the scalar field in curved space- 
time. We simply treat g,,,, as given [e.g., the Schwarzschild metric in spheri- 
cal coordinates: g99 = (1—- 2GM/r), g,,=—-(A—- 2GM/r)~', g99 =—r’, and 
£49 = —?° sin’ 6] and study the path integral { Dge’*, which is still a Gaussian 
integral and thus do-able. The propagator of the scalar field D(x, y) can be worked 
out and so on and so forth. (see Exercise I.10.1). 

At this point, aside from the fact that g,,,(x) carries Lorentz indices while 
v(x) does not, the metric g,,,, looks just like a field and is in fact a classical field. 
Write the action of the world S = S, + Sy as the sum of two terms: S, describing 
the dynamics of the gravitational field g,,,, (which we will come to in Chapter 
VIII. 1) and Sj describing the dynamics of all the other fields in the world [the 
“matter fields,” namely gy in our simple example with Sy as given in (3)]. We 
could quantize gravity by integrating over g,,, as well, thus extending the path 
integral to { Dg Dge'®. 

Easier said than done! As you have surely heard, all attempts to integrate over 
8,y(x) have been beset with difficulties, eventually driving theorists to seek solace 


2 We also have to replace ordinary derivatives 3,, by the covariant derivatives D,, of 
general relativity, but acting on a scalar field g the covariant derivative is just the ordinary 
derivative D,,.g = 9,9. 
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in string theory. I wili explain in due time why Einstein’s theory is known as 
“nonrenormalizable.” 


What the graviton listens to 


One of the most profound results to come out of Einstein theory of gravity is a 
fundamental definition of energy and momentum. What exactly are energy and 
momentum any way? Energy and momentum are what the graviton listens to. (The 
graviton is of course the particle associated with the field g,,,.) 

The stress-energy tensor J” is defined as the variation of the matter action Sj, 
with respect to the metric g,,, (holding the coordinates x" fixed): 


2 éS 
TYY(x) =-—— —*# (4) 
V—8 88, y(X) 
Energy is defined as E = P® = f d°x,/—gT™(x) and momentum as P! = 


f dx.f—gT" (x). 

Even if we are not interested in curved spacetime per se, (4) still offers us a 
simple (and fundamental) way to determine the stress energy of a field theory 
in flat spacetime. We simply vary around the Minkowski metric 7,,, by writing 
Suv = Nyy +h,, and expand Sy to first order in h. According to (4), we have? 


Sy(h) = Sy(h = 0) — J d*x [sey vT” + ow’)] (5) 


The symmetric tensor field /,,,,(x) is in fact the graviton field (see Chapters 1.5 
and VIII.1). The stress-energy tensor 7“” (x) is what the graviton field couples to, 
just as the electromagnetic current J/“(x) is what the photon field couples to. 

Consider a general Sy = f d4x./—g(A + g#"B,, + gH’ g"’Cuyig +- °°). 
Since —g=1+ 7h, + O(h”) and g#” = n#” — h#” + O(h*), we find 


Ty = 2(Biy + 2Cywvapn’? +5) — Nuv® (6) 
in flat spacetime. Note 
T = n"T,y = —(4A + 2n"B,y +0- YN? Cyvag °°”) (7) 


which we have written in a form emphasizing that C does not contribute to 
the trace of the stress-energy tensor. 
We now show the power of this definition of T,,,, by obtaining long-familiar re- 


sults about the electromagnetic field. Promoting the Lagrangian of the massive spin 


VAP 


3T use the normal convention in which indices are summed regardless of any symmetry; 
in other words, 44,,)T#Y = 4 (hg T®! + AyoT +++) = hy T +---. 
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1 field to curved spacetime we have* £ = (—igt’s? F Fy, + tm?g” A Ay) 
and thus” T= —F,,F\ + m*A,Ay—myyh. 

For the electromagnetic field we set m=0. First, & = —}F,,,F*" =—} 
(—2F5, + FZ) = 4(E? — B?). Thus, Ty = — Fo, Fy’ — 3(E? — B*) = 3(E? + 
87). That was comforting, to see a result we knew from “childhood.” Incidentally, 
this also makes clear that we can think of E? as kinetic energy and B? as potential 
energy. Next, To; = —Fo, F,* = Foj Fi; = €1;,E ;B, = (E x B);—the Poynting 
vector has just emerged. 


Since the Maxwell Lagrangian & = —1 "9%? F a yp involves only the C term 
with Curio = —4F uv Pip, we see from (7) that the stress-energy tensor of the 


electromagnetic field is traceless, an important fact we will need in chapter VIIL.1. 
We can of course check directly that T = 0 (Exercise I.10.4).® 


Appendix: A concise introduction to curved spacetime 


General relativity is often made to seem more difficult and mysterious than need be. Here 
I give a concise review Of some of its basic elements for later use. 

Denote the spacetime coordinates of a point particle by X“. To construct its action note 
that the only coordinate invariant quantity is the “length’’ of the world line traced out by 
the particle (Fig. 1.10.1), namely f ds = f ./g,,,»dX"dX”, where g,,, is evaluated at the 
position of the particle of course.) Thus, the action for a point particle must be proportional 


to 
dXt# dx” 
ds = dX'dX" = x — d 
J 5 J Buy a J Sul (g)] dt dt 4 


where ¢ is any parameter that varies monotonically along the world line. The length, being 
geometric, is manifestly reparametrization invariant, that is, independent of our choice 
of ¢ as long as it is reasonable. This is one of those “more obvious than obvious” facts 
since f ./g,,,dX"dX” is manifestly independent of ¢. If we insist, we can check the 
reparametrization invariance of { ds. 


4 Here we use the fact that the covariant curl is equal to the ordinary curl D,Ay — 
D,A, = 4, A, — 4,A, and so F,,, does not involve the metric. 

> Holding x“ fixed means that we hold 8,, and hence A,, fixed. [Since A, is related to 3, 
by gauge invariance, we are anticipating (see Chapter II.7) here, but you have surely heard 
of galige invariance in a nonrelativistic context. ] 

© We see that tracelessness is related to the fact that the electromagnetic field has no 
mass scale. Pure electromagnetism is said to be scale or dilatation invariant. For more on 
dilatation invariance see S. Coleman, Aspects of Symmetry, p. 67. 

7 We put “length” in quotes because if g,,, had a Euclidean signature then { ds would 
indeed be the length and minimizing { ds would give the shortest path (the geodesic) 
between the endpoints, but here g,,, has a Minkowskian signature. 
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x0) 


Figure 1.10.1 


Obviously the powers of df match. Explicitly, if we write ¢ =¢(n), then 
dX" /dt = (dn/df)(dX"/dn) and df = (dg /dn)dn. 
Let us define 


dX" dx” 
dt dt 


K =g,,[X€)]—_ 


for ease of writing. Setting the variation of f d C/K equal to zero, we obtain 


fas dx! ddX" | dx" dX” 
VR 8 ae de a a 


which upon integration by parts (and with 6X* = 0 at the endpoints as usual) gives the 
equation of motion 


d 1 dX! dX" dX” 
K— (—~22,, = 
VRE (zz ae Te )- *Buv ae de ®) 


To simplify (8) we exploit our freedom in choosing ¢ and set df = ds, so that K = 1. 
We have 


@XH 4, , dxodX* 4 dXH dX” 
ds? o8uk as ds *8#” as. ds 


220. 


which upon multiplication by g°* becomes 


d*XP dX" dX” 
“ds? += 3° (24,8444 — 93,84») ds. ds =0 


that is 


d? XP dX" dx” 


ds? wl X (9) 


=0 (9) 
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if we define the Riemann-Christoffel symbol by 
Phy = 78 (u8va + vB ya — 8 yv) (10) 


Given the initial position X“(sp) and velocity (dX“/ds)(so) we have four second order 
differential equations (9) determining the geodesic followed by the particle in curved 
spacetime. Note that, contrary to the impression given by some texts, unlike (8), (9) is 
not reparametrization invariant. 

To recover Newtonian gravity, three conditions must be met: (1) the particle moves 
slowly dX'/ds < dX ° /ds; (2) the gravitational field is weak, so that the metric is almost 
Minkowskian g,,, ~ nv +f, and (3) the gravitational field does not depend on time. 
Condition (1) means that d?X° /ds* + P'j)(dX°/ds)* ~ 0, while (2) and (3) imply that 
Phy < —30°*a,hoo. Thus, (9) reduces to "a2x0 /ds? = 0 (which implies that dX°/ds is 
a constant) and d?X! /ds? + 59; Ao (ad X°/ds)? =~ 0, which since X° is Proportional to s 
becomes d?X' /dt? = 49; iftog. Thus, we obtain Newton’s equation 4 ii ax ~ Vo if we 
define the gravitational potential @ by fgg = 2¢: 


Referring to the Schwarzschild metric, we see that far from a massive body, ¢ = —GM/r, 
as we expect. (Note also that this derivation depends neither on h,; nor on ho;, as long as 
they are time independent.) 

Thus, the action of a point particle is 


d Xt dX” | 
S=-m J «| Sud X"d XY = —m J BX Or Ge 4 (12) 


The m follows from dimensional analysis. 

A slick way of deriving S (which also allows us to see the minus sign) is to start with 
the nonrelativistic action of a particle in a gravitational potential ¢, namely S = f Ldt = 
fGmv? —m —mo?)dt. Note that the rest mass m comes in with a minus sign as it is part 
of the potential energy in nonrelativistic physics. Now force S into a relativistic form: For 


small v and ¢, 
1 
saam fa-svteydra—m f f= v8 + 26a 
=m J y (+ 29)(dt)? — (dx)? 


We see® that the 2 in (11) comes from the square root in the Lorentz-Fitzgerald quantity 


V1—v?. 


8 We should not conclude from this that g; ; = 5;;, The point is that to leading order in 
v/c, our particle is sensitive only to 2g9, as we have just shown. Indeed, restoring c in the 
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Now that we have S we can calculate the stress energy of a point particle using (4): 


rey = fa x-k8y — xy eee 
(x) Ta g [x — X(f)] de. dt 


Setting ¢ to s (which we call the proper time r in this context) we have 


TH (x) = = / dr8[x — X(r 1 


In particular, as we expect the 4-momentum of the particle is given by 


pr= | ax./—gT” =m J drd[x® — Xr yexe dX” me 
dt dt dt 
The action (12) given here has two defects: (1) it is difficult to deal with a path integral 
f DXei™ I ACECILINCE NTL) involving a square root, and (2) S does not make sense 
for a massless particle. 
To remedy these defects, note that classically, S is equivalent to 


1 dX" dX, 3 
2a 1 
Simp = -5 fa € de dt + ym?) (13) 


where (dX“/df)(dX,,/do) = g,,,(X)(@X"/df)(dX"/d¢). Varying with respect 
to y(¢) we obtain my * = (dX"/dé)(dX,,/dt). Eliminating y in Si_, we te- 
cover S, 

The path integral f DXe'%w has a standard quadratic form.? Quantum mechanics of 
a relativistic point particle is best formulated in terms of §;,,), not S. Furthermore, for 
m=0, Simp = —3 f dély~'(dX" /dt)(dX,,/de)] makes perfect sense. Note that varying 
with respect to y now gives the well-known fact that fora massless particle g uv(X)d X¥dX” 
= 0, 

The action §;,,, will provide the starting point for our discussion on string theory in 
Chapter VIIL5. 


Schwarzschild metric we have 


ds? =(1— 26M =e dt? -(1- 26M) ldr? — do? — r* sin? @dd* > cd?” 
cr 


2GM 


— dx? — ——dt? + O(1/c?) 


* One technical problem, which we will address in Chapter IIL.4, is that in the integral 
over X(¢) apparently different functions X(¢) may in fact be the same physically, related 
by a reparametrization. 


1.10.1. 


1.10.2. 


1.10.3. 


1.10.4. 
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Exercises 


Integrate by parts to obtain for the scalar field action 


S=- / d‘*x./—g soe =.= a8", +m) 


and write the equation of motion for g in curved spacetime. Discuss the propagator 
of the scalar field D(x, y) (which is of course no longer translation invariant, i.e., 
it is no longer a function of x — y). 


Use (4) to find £ for a scalar field theory in flat spacetime. Show that the result 
agrees with what you would obtain using the canonical formalism of Chapter 1.8. 


Show that in flat spacetime P“ as derived here from the stress energy tensor T+” 
when interpreted as an operator in the canonical formalism satisfies [P", g(x)]= 
—id" g(x), and thus does exactly what you expect the energy and momentum 
operators to do, namely to be conjugate to time and space and hence represented 
by —ia". 


Show that for the Maxwell field 7;; = —(E;E; + B; Bj) + 46,;(E? + B?) and 
hence T = 0. 


Chapter I.11 
Field Theory Redux 


What have you learned so far? 
Now that we have reached the end of Part I, let us take stock of what you have 


learned. Quantum field theory is not that difficult; it just consists of doing one 
great big integral 


if d+ 71 (ag)2— 1 2922.64 
Z(J) -| Do e fa + x[3 (0g) Zin Ag +i] (1) 
By repeatedly functionally differentiating Z(J) and then setting J = 0 we obtain 
i f qPt+lyrl 2_ dd y22_ axen4 
J Do 9(x)e(%2) --- ee’ f aP+1x13 @g)?—4m2g?-194)] (2) 


which tells us about the amplitude for n particles associated with the field g to come 
into and go out of existence at the spacetime points x1, x», - + -, x,, interacting with 
each other in between. Birth and death, with some kind of life in between. 

Ah, if we could only do the integral in (1)! But we can’t. So one way of going 
about it is to evaluate the integral as a series in A : 


=. (—iayé +1 2 22 
YS fe ocsroen)--oeit faye oytte JO'Go ante 


(3) 


To keep track of the terms in the series we draw little diagrams. 

Quantum field theorists try to dream up ways to evaluate (1), and failing that, 
they invent tricks and methods for extracting the physics they are interested in, by 
hook and by crook, without actually evaluating (1 ). 

To see that quantum field theory is a straightforward generalization of quantum 
mechanics, look at how (1) reduces appropriately. We have written the theory 
in (D + 1)-dimensional spacetime, that is, D spatial dimensions and 1 temporal 
dimension. Consider (1) in (0 + 1)—dimensional spacetime, that is, no space; it 
becomes 


z= | De oi f ath By —4m2g? Ag+ Jo) (4) 
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where we now denote the spacetime coordinate x just by time t. We recognize 
this as the quantum mechanics of an anharmonic oscillator with the position of the 
mass point tied to the spring denoted by gy and with an external force J pushing 
on the oscillator. 

In the quantum field theory (1), each term in the action makes physical sense: 
The first two terms generalize the harmonic oscillator to include spatial variations, 
the third term the anharmonicity, and the last term an external probe. You can think 
of a quantum field theory as an infinite collection of anharmonic oscillators, one 
at each point in space. 

We have here a scalar field g. In previous and future chapters, the notion of field 
was and will be generalized ever so slightly: The field can transform according to 
a nontrivial representation of the Lorentz group. We have already encountered 
fields transforming as a vector and a tensor and will presently encounter a field 
transforming as a spinor. Lorentz invariance and whatever other symmetries we 
have constrain the form of the action. The integral will look more complicated but 
the approach is exactly as outlined here. 

That’s just about all there is to quantum field theory. 


PART II 
DIRAC AND THE SPINOR 


Chapter II.1 
The Dirac Equation 


Staring into a fire 


According to a physics legend, apparently even true, Dirac was staring into a fire 
one evening in 1928 when he realized that he wanted, for reasons that are no longer 
relevant, a relativistic wave equation linear in spacetime derivatives 0,, = 0/dx". 
At that time, the Klein-Gordon equation (a7 + mp = 0, which describes a free 
particle of mass m and quadratic in spacetime derivatives, was already well known. 
This is in fact the equation of motion of the scalar field theory we studied earlier. 

At first sight, what Dirac wanted does not make sense. The equation is supposed 
to have the form “some linear combination of 3,, acting on some field ¥ is equal to 
some constant times the field.” Denote the linear combination by c“d,,. If the c#’s 
are four ordinary numbers, then the four-vector c“ defines some direction and the 
equation cannot be Lorentz invariant. 

Nevertheless, let us follow Dirac and write, using modern notation, 


(Giy"d, —m)y =0 (1) 


At this point, the four quantities iy“ are just the coefficients of 4,, and m is just 
a constant. We have already argued that y” cannot simply be four numbers. Well, 
let us see what these objects have to be in order for this equation to contain the 
correct physics. 

Acting on (1) with Gy“, +m), Dirac obtained —(y“y" 4,8, +m) =0. It 
is traditional to define, in addition to the commutator [A, B]= AB — BA familiar 
from quantum mechanics, the anticommutator {A, B} = AB + BA. Since deriva- 
tives commute, y“y"d,0, = 3{y", y"}0,0,, and we have (}{y", y"}d,,a, + 
m?)y = 0. In a moment of inspiration Dirac realized that if 


fy", y"} = 2” (2) 


with n“” the Minkowski metric he would obtain (87 + m”)w = 0, which describes 
a particle of mass m, and thus (1) would also describe a particle of mass m. 
Since n“” is a diagonal matrix with diagonal elements n° = 1 and n// = —1, 
(2) says that (y°)? = 1, (y/)? =—1, and p4#y” = —y’y" for uw # v. This last 
statement, that the coefficients y“ anticommute with each other, implies that they 
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indeed cannot be ordinary numbers. Dirac’s thought would make sense if we could 
find four such objects. 


Clifford algebra 


A set of objects y” (clearly d of them in d-dimensional spacetime) satisfying the 
relation (2) is said to form a Clifford algebra. I will develop the mathematics of 
Clifford algebra later. Suffice it for you to check here that the following 4 by 4 
matrices satisfy (2): 


y=() °,)=1e5 3) 


v= (20 0 ) =o! ein (4) 


Here o and t denote the standard Pauli matrices. For historical reasons the four 
matrices y“ are known as gamma matrices——not a very imaginative name! (Our 
convention is such that whether an index on a Pauli matrix is upper or lower has no 
significance. On the other hand, we define y,, = n,,,y" and it does matter whether 
the index on a gamma matrix is upper or lower; it is to be treated just like the index 
on any Lorentz vector. This convention is useful because then y“d, = y,,0".) 

The direct product notation is convenient for computation: For example, y!y/ = 
(0! Qitm)(o/ @ it) = (o'a! @ i?t)%) = —(c'c/ @ 1) and thus {y', y/}= 
—{o', o/} @ I = —28!/ as desired. 

You can convince yourself that the y”’s cannot be smaller than 4 by 4 matrices. 
The mathematics forces the Dirac spinor y& to have 4 components! The physical 
content of the Dirac equation (1) is most transparent if we transform to momentum 
space: we plug w(x) = [[d*p/(2x)*]e'?* &(p) into (1) and obtain 


(y"p, — mv (p) =0 (3) 


Since (5) is Lorentz invariant, as we will show below, we can examine its 
physical content in any frame, in particular the rest frame p’ = (m, 0), in which 
it becomes 


(y°- ly =0 (6) 


As (y° — 1)* = —2(y° — 1) we recognize (y° — 1) as a projection operator up to 
a trivial normalization. Indeed, using the explicit form in (3), we see that there is 
nothing mysterious to Dirac’s equation: When written out (6) reads 


0 0 
(0 i)¥=" 


thus telling us that 2 of the 4 components in % are zero. 
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This makes perfect sense since we know that the electron has 2 physical degrees 
of freedom, not 4. Viewed in this light, the mysterious Dirac equation is no more 
and no less than a projection that gets rid of the unwanted degrees of freedom. 
Compare our discussion of the equation of motion of a massive spin-1 particle 
(Chapter I.5). There also, 1 of the 4 components of A” is projected out. Indeed, 
the Klein-Gordon equation (a7 + m*)p(x) =0 just projects out those Fourier 
components (k) not satisfying the mass shell condition k* = m”. Our discussion 
provides a unified view of the equations of motion in relativistic physics: They just 
project out the unphysical components. 

The convenient notation introduced by Feynman, ¢ = y“a,, for any 4-vector 
a,,, is now standard. The Dirac equation then reads @ A —m)y =0. 


Cousins of the gamma matrices 


Under a Lorentz transformation x’” = A" x, the 4 components of the vector field 
A,, transform like, well, a vector. How do the 4 components of y transform? Surely 
not in the same way as A,, since even under rotation ¥ and A,, transform quite 
differently: one as spin 4 and the other as spin 1. Let us write w(x) > w(x) = 
S(A)w (x) and try to determine the 4 by 4 matrix S(A). 

It is a good idea to first sort out (and name) the 16 linearly independent 4 by 
4 matrices. We already know five of them: the identity matrix and the y”’s. The 
strategy is simply to multiply the y” ’s together, thus generating more 4 by 4 
matrices until we get all 16. Since the square of a gamma matrix y” is equal to 
+1 and the y“’s anticommute with each other, we have to consider only y“y”, 
yly’y*, and y“y’ yy? with 2, v, A, and p all different from one another. Thus, 
the only product of four gamma matrices that we have to consider is 


peiyyly?y? (7) 


This combination is so important that it has its own name! (The peculiar name 
comes about because in some old-fashioned notation the time coordinate was 
called x* with a corresponding y*.) We have 


y=il @t)(o! Qin) (o* Bit) (co? @it,) =i @1;)(0'07o07 @ tm) 


and so 


07 
y=1e%=(5 0) (8) 


With the factor of i included, y° is manifestly hermitean. An important property 
is that y> anticommutes with the y’s: 


{y°, y“}=0 (9) 


Continuing, we see that the products of three gamma matrices, all different, can 
be written as yy? (e.g., y!ly*y? = —iy°y°). Finally, using (2) we can write the 
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product of two gamma matrices as y4y” = n¥” — io”, where 
ot = tye y"] (10) 


There are 4 -3/2 = 6 of these o%” matrices. 

Count them, we got all 16. The set of 16 matrices {1, y”, 0%”, yp“, y>} forms 
a complete basis of the space of all 4 by 4 matrices, that is, any 4 by 4 matrix can 
be wnitten as a linear combination of these 16 matrices. 

It is instructive to write out 0%” explicitly in the representation (3) and (4): 


otai(? ° (11) 
ae 
ot} = glik (“ x) (12) 


We see that o/ are just the Pauli matrices doubly stacked, for example, 


Lorentz transformation 


Recall from a course on quantum mechanics that a general rotation can be written 
as e!9/ with J the 3 generators of rotation and 6 3 rotation parameters. Recall also 
that the Lorentz group contains boosts in addition to rotations, with K denoting 
the 3 generators of boosts. Recall from a course on electromagnetism that the 
6 generators { I,K } transform under the Lorentz group as the components of 
an antisymmetric tensor just like the electromagnetic field F,,,, and thus can be 
denoted by J,,,. I will discuss these matters in more detail in Chapter II.3. For 
the moment, suffice it to note that with this notation we can write a Lorentz 
transformation as A =e 72 TeY with J‘/ generating rotations, J° generating 
boosts, and the antisymmetric tensor w,,, = —@,,, With its 6 = 4 3/2 components 
corresponding to the 3 rotation and 3 boost parameters. 

Given the preceding discussion and the fact that there are six matrices a”, 
we suspect that up to an overall numerical factor the o””’s must represent the 
6 generators J” of the Lorentz group acting on a spinor. In fact, our suspicion 
is confirmed by thinking about what a rotation e217" does, Referring to (12) 
we see that if J‘/ is represented by 10‘! this would correspond exactly to how a 
spin-} particle transforms in quantum mechanics. More precisely, separate the 4 
components of the Dirac spinor into 2 sets of 2 components: 


_{¢ 
v=(%) (13) 
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From (12) we see that under a rotation around the 3rd axis, ¢ > e 11230 @ and 
x > e101 20° x. Itis gratifying to see @ and x transform like 2-component Pauli 
spinors. 

We have thus figured out that a Lorentz transformation A acting on w is 
represented by S(A) = e7"/ 4)euvo"™ and so, acting on y, the generators J+” are 
indeed represented by 40%”. Therefore we would expect that if y (x) satisfies the 
Dirac equation (1) then ¥’(x’) = S(A)¥(x) would satisfy the Dirac equation in 
the primed frame, 


(iy), — m)y'(x') =0 (14) 


where a, = 0/x'". To show this, calculate [o“", y*] = 2i(y“n”* — y’n**) and 
hence for @ infinitesimal Sy*S~!= y* ~ G/N)o, lo", yy=y+ + yto.. 
Building up a finite Lorentz transformation by compounding infinitesimal trans- 
formations (just as in the standard discussion of the rotation group in quantum 
mechanics), we have Sy*S—! = Aiy*. 


Dirac bilinears 


The Clifford algebra tells us that (y°y? = +1 and (y')* = —1; hence the necessity 
for the i in (4). One consequence of the i is that y® is hermitean while y’ is 
antihermitean, a fact conveniently expressed as 


(y*)' = p®y#y? (15) 


Thus, contrary to what you might think, the bilinear y'y“y is not hermitean; 
rather, wy"y is hermitean with y = y'y®. The necessity for introducing w in 
addition to yw" in relativistic physics is traced back to the (+, —, —, —) signature 
of the Minkowski metric. 

It follows that (o#")' = p°a#’y9, Hence, S(A)! = p9e/Mene 19, (which 
incidentally, clearly shows that S is not unitary, a fact we knew since oo; is not 
hermitean), and so 


P(x!) = W(x) SA) Y= Pret eu (16) 
We have 
Ww (x9W'(x’) —_ plxjetG/Menve™ p—U/Menva" W(x) = o(x) W(x) 


You are probably used to writing yw‘ in nonrelativistic physics. In relativistic 
physics you have to get used to writing yy. Itis wy, not y'y, that transforms 
as a Lorentz scalar. 

There are obviously 16 Dirac bilinears yT'y that we can form, corresponding 
to the 16 linearly independent I"’s. You can now work out how various fermion 
bilinears transform (Exercise II.1.1). The notation is rather nice: Various objects 
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transform the way it looks like they should transform. We simply look at the 
Lorentz indices they carry. Thus, (x) vy" w(x) transforms as a Lorentz vector. 


Parity 


An important discrete symmetry in physics is that of parity or reflection in a mirror! 


xt —» xB = (x9, —¥) (17) 


Multiply the Dirac equation (1) by y°: y°(iy"8, — m)y(x) =0 = (iy#d!, — 
m)y v(x), where a7, = 9/3x". Thus, 


v(x) = ny w(x) (18) 


satisfies the Dirac equation in the space-reflected world (where 7 is an arbitrary 
phase that we can set to 1). 

Note, for example, p(x’) (x') = P(x) p(x) but Way? y'(x') = vo)y"y? 
yow(x) = —v(x)y5y(x). Under a Lorentz transformation y(x)y°w(x) and 
w(x) (x) transform in the same way but under space reflection they transform 
in an opposite way; in other words, while y(x)y(x) transforms as a scalar, 
w(x) yw (x) transforms as a pseudoscalar. 

You are now ready to do the all-important exercises in this chapter. 


The Dirac Lagrangian 


An interesting question: What Lagrangian would give Dirac’s equation? The an- 
swer iS 


L=wi db-m)y (19) 


Since y is complex we can vary y and y independently to obtain the Euler- 
Lagrange equation of motion. Thus, 4,,(54/60,, y) — 5£/dy = 0 gives 3, (ipy”) 
+my = 0, which upon hermitean conjugation and multiplication by y® gives the 
Dirac equation (1). The other variational equation ,,(64/ 5a,) — 6L/sy =0 
gives the Dirac equation even more directly. (If you are disturbed by the asymmetric 
treatment of y and y, you can always integrate by parts in the action, have d,, act 


1 Rotations consist of all linear transformations x! —> R/x/ such that det R = +-1. Those 
transformations with det R = —1 are composed of parity followed by a rotation. In (3 + 1)- 
dimensional spacetime, parity can be defined as reversing one of the spatial coordinates 
or all three spatial coordinates. The two operations are related by a rotation. Note that in 
odd dimensional spacetime, parity is not the same as space inversion, in which all spatial 
coordinates are reversed (see Exercise IT.1.11). 
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on w in the Lagrangian, and then average the two forms of the Lagrangian. The 
action S = f d‘xL treats y and y symmetrically.) 


Slow and fast electrons 


Given a set of gamma matrices it is straightforward to solve the Dirac equation 
(p—m)v(p)=90 (20) 


for ¥(p): It is a simple matrix equation (see Exercise II.1.3). 

Note that if somebody uses the gamma matrices y“, you are free to use instead 
y'# = W-!y#W with W any 4 by 4 matrix with an inverse. Obviously, y also 
satisfy the Clifford algebra. This freedom of choice corresponds to a simple change 
of basis. Physics cannot depend on the choice of basis, but which basis is the most 
convenient depends on the physics. 

For example, suppose we want to study a slowly moving electron. Let us use 
the basis defined by (3) and (4), and the 2-component decomposition of y (13). 
Since (6) tells us that x (p) = 0 for an electron at rest, we expect x (p) to be much 
smaller than ¢(p) for a slowly moving electron. 

In contrast, for momentum much larger than the mass, we can approximate (20) 
by pw(p) =0. Multiplying on the left by y>, we see that if ¥(p) is a solution then 
yy (p) is also a solution since y> anticommutes with y“. Since (y°)? = 1, we 
can form two projection operators P; = 1(1— y>) and Pp = 3(1+ y>), satisfying 
P? = P,, P? = Pp, and P; Pp = 0. It is extremely useful to introduce the two 
combinations yw, = 5(1 — y>)v and p= s(1+ y>)y. Note that yw, =—vW, 
and y>Wp = +p. Physically, a relativistic electron has two degrees of freedom: it 
can spin either clockwise or anticlockwise around the direction of motion. I leave 
it to you as an exercise to show that y, and wp correspond to precisely these two 
possibilities. The subscripts ZL and R indicate left and right handed. Thus, for fast 
moving electrons, a basis known as the Wey] basis, designed so that y°, rather than 
y°, is diagonal, is more convenient. Instead of (3), we choose 


yr=(‘ safer (21) 


We keep y’ as in (4). This defines the Wey] basis. We now calculate 


y sip yly?y? =i ®@ a) '0°o? @Pm) = —T ®t) = ( S 9) (22) 
which is indeed diagonal as desired. The decomposition into left and right handed 
fields is of course defined regardless of what basis we feel like using, but in the 
Weyl basis we have the nice feature that y, has two upper components and yp 
has two lower components. The spinors yy, and wp are known as Weyl spinors. 

Note that in going from the Dirac to the Weyl basis y° and y° trade places (up 
to a sign): 
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Dirac :y°diagonal; Weyl :y°diagonal. (23) 


Physics dictates which basis to use: We prefer to have y° diagonal when we deal 
with slowly moving spin 5 particles, while we prefer to have y> diagonal when we 
deal with fast moving spin 5 particles . 

I note in passing that if we define o” = (J, o) anda" = (I, —c) we can write 


more compactly in the Weyl basis. (We develop this further in Appendix E.) 


Chirality or handedness 


Regardless of whether a Dirac field y (x) is massive or massless, it is enormously 
useful to decompose y into left and right handed fields yr (x) = Ww, (x) + Wr) = 
11 — py) (x) + 41 + y°) (x). As an exercise, show that you can write the 
Dirac Lagrangian as 


L=W A—mv =i Avi + Vpi AvR—MVivR+ rv) (24) 


The kinetic energy connects left to left and right to right, while the mass term 
connects left to right and right to left. 

The transformation y — e!’y leaves the Lagrangian © invariant. Applying 
Noether’s theorem, we obtain the conserved current associated with this symmetry 
J# =wy"w. Projecting into left and right handed fields we see that the they 
transform the same way: y, > ey, and yp — eo vp. 

If m =0, & enjoys an additional symmetry, known as a chiral symmetry, 
under which y — e!? Yay. Noether’s theorem tells us that the axial current J># = 
py" yw is conserved. The left and right handed fields transform in opposite ways: 
v,— e Py, and Wp — e'? yp. These points are particularly obvious when £ is 
written in terms of yr, and yp, as in (24). 

In 1956 Lee and Yang proposed that the weak interaction does not preserve 
parity. It was eventually realized (with these four words I brush over a beautiful 
chapter in the history of particle physics; I urge you to read about it!) that the weak 
interaction Lagrangian has the generic form 


L=Gwiy vo War Y War (25) 


where Wf, 23,4 denotes four Dirac fields and G the Fermi coupling constant. This 
Lagrangian clearly violates parity: Under a spatial reflection, left handed fields are 
transformed into right handed fields and vice versa. 

Incidentally, henceforth when I say a Lagrangian has a certain form, I will 
usually indicate only one or more of the relevant terms in the Lagrangian, as in 
(25). The other terms in the Lagrangian, such as W,(i 2 — m,)yy, are understood. 
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If the term is not hermitean, then it is understood that we also add its hermitean 
conjugate. 


Interactions 


As we saw in (25) given the classification of bilinears in the spinor field you worked 
out in an exercise it is easy to introduce interactions. As another example, we can 
couple a scalar field y to the Dirac field by adding the term ggwy (with g some 
coupling constant) to the Lagrangian £ = (i f — m)y (and of course also adding 
the Lagrangian for g). Similarly, we can couple a vector field A,, by adding the 
termeA uW yw. We note that in this case we can introduce the covariant derivative 
D, = 9, —ieA, andwriteL =~ J-—m)y +eA, wy" =viy"D, —my. 
Thus, the Lagrangian for a Dirac field interacting with a vector field of mass pz reads 


L=(iy"D, —m)v — Fy FY’ — 5y7A, AY (26) 


If the mass yz vanishes, this is the Lagrangian for quantum electrodynamics. 
Varying with respect to y, we obtain the Dirac equation in the presence of an 
electromagnetic field: 


liy*(@, —ieA,) —m]y =0 (27) 


Charge conjugation and antimatter 


With coupling to the electromagnetic field, we have the concept of charge and 
hence of charge conjugation. Let us try to flip the charge e. Take the complex 
conjugate of (27): [—iy**(0,, + ieA,,) — m]y* = 0. Complex conjugating (2) we 
see that the —y* also satisfy the Clifford algebra and thus must be the y“ matrices 
expressed in a different basis, that is, there exists a matrix C y® (the notation with an 
explicit factor of y° is standard; see below) such that —y#* = (Cy®)!y#(Cy®). 
Plugging in, we find that 


liy"(@, +ieA,) — my. =0 (28) 


where we have defined yw, = Cy°w*. Thus, if w is the field of the electron, then 
y, is the field of a particle with a charge opposite to that of the electron but with 
the same mass, namely the positron. 

The discovery of antimatter was one of the most momentous in twentieth- 
century physics. We will discuss antimatter in more detail in the next chapter. 

It may be instructive to look at the specific form of the charge conjugation matrix 
C. We can write the defining equation for C as Cy°y#*y°C7—! = —y. Complex 
conjugating the equation (y#)! = y°y#y°, we obtain (y#)? = p9y#*y® if y° is 
real. Thus, 


- (yh =-C7ly#C (29) 
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which explains why C is defined with a y° attached. 

In both the Dirac and the Wey] bases ? is the only imaginary gamma matrix. 
Then the defining equation for C just says that Cy® commutes with y? but 
anticommutes with the other three y matrices. So evidently C = y7y° [up to an 
arbitrary phase not fixed by (29)] and indeed y7y“*y? = y#. Note that we have 
the simple (and satisfying) relation 


bear" (30) 


You can easily convince yourself (Exercise 8) that the charge conjugate of a left 
handed field is right handed and vice versa. As we will see later, this fact turns out 
to be crucial in the construction of grand unified theory. Experimentally, itis known 
that the neutrino is left handed. Thus, we can now predict that the antineutrino is 
right handed. 

Furthermore, 7, transforms as a spinor. Let’s check: Under a Lorentz transfor- 
mation y > e~@/%#.7"" yy, complex conjugating we have y* > et @/Meus(o")" 
w*; hence py. > p2et @ Menor y* = e-E/Veuwe" y [Recall from (10) that 
o” is defined with an explicit 7.] 

Note that C7 = y°y? = —C in both the Dirac and the Wey] bases. 


Majorana neutrino 


Since y, transforms as a spinor, Majorana” noted that Lorentz invariance allows 
not only the Dirac equation 18 = my but also the Majorana equation 


ipy=my, G1) 


Complex conjugating this equation and multiplying by y*, we have 
—y7iy* da," = y?m(—y*)y, that is, Ay, = my. Thus, —d°p = ip(isy) = 
ifmy, =m. As we anticipated, m is indeed the mass, known as a Majorana 
mass, of the particle associated with y. 

The Majorana equation (31) can be obtained from the Lagrangian? 


L=pipy — pmb Cw + ocy") (32) 


upon varying y. 
Since y and y, carry opposite charge, the Majorana equation, unlike the Dirac 
equation, can only be applied to electrically neutral fields. However, as y, is right 


2 Ettore Majorana had a brilliant but tragically short career. While still in his twenties, he 
disappeared off the coast of Sicily during a boat trip. The precise cause of his death remains 
a mystery. 

3 Upon recalling that C is antisymmetric, you may have worried that $7 Cy = Cygvyvg 
vanishes. In future chapters we will learn that y has to be treated as anticommuting 
“Grassmannian numbers.” 
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handed if y is left handed, the Majorana equation, again unlike the Dirac equation, 
preserves handedness. Thus, the Majorana equation is almost tailor made for the 
neutrino. 

From its conception the neutrino was assumed to be massless, but couple of 
years ago experimentalists established that it has a small but nonvanishing mass. 
As of this writing, it is not known whether the neutrino mass is Dirac or Majorana. 
We will see in Chapter VII.7 that a Majorana mass for the neutrino arises naturally 
in the SO(10) grand unified theory. 

Finally, there is the possibility that y = y,, in which case y is known as a 
Majorana spinor. 


Time reversal 


Finally, we come to time reversal,* which as you probably know, is much more 
confusing to discuss than parity and charge conjugation. In a famous 1932 paper 
Wigner showed that time reversal is represented by an antiunitary operator. Since 
this peculiar feature already appears in nonrelativistic quantum physics, it is in 
some sense not the responsibility of a book on relativistic quantum field theory to 
explain time reversal as an antiunitary operator. Nevertheless, let me try to be as 
clear as possible. I adopt the approach of “letting the physics, namely the equations, 
lead us.” 

Take the Schrédinger equation i(0/dt)¥(t) = H(t) (and for definiteness, 
think of H = —(1/2m)V? + V(X), just simple one particle nonrelativistic quantum 
mechanics.) We suppress the dependence of YW on x. Consider the transformation 
t > t/ =—1t. We want to find a W(t’) such that i(0/dr’)W'(t’) = HW'(t’). Write 
w(t’) = T(t), where T is some operator to be determined (up to some arbitrary 
phase factor 7). Plugging in, we have i[0/0(—O)]T V(t) = HT V(t). Multiply 
by T~|, and we obtain T~'(-i)T (0/02) W(t) = T~'HT W(t). Since H does not 
involve time in any way, we want T~'H = HT~!. Then T~!(—i)T (8/02) V(t) = 
HW (t). We are forced to conclude, as Wigner was, that 


T \(-i)T =i (33) 


Speaking colloquially, we can say that in quantum physics time goes with ani and 
so flipping time means flipping 7 as well. 

Let T =U K, where K complex conjugates everything to its right. Then T= 
KU! and (33) holds if U-'4U =i, that is, if VU! is just an ordinary (unitary) 
operator that does nothing to i. We will determine U as we go along. The presence 
of K makes T “antiunitary.” 


4 Incidentally, I do not feel that we completely understand the implications of time- 
reversal invariance. See A. Zee, “Night thoughts on consciousness and time reversal,” in: 
Art and Symmetry in Experimental Physics: pp. 246-249 . 
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We check that this works for a spinless particle in a plane wave state Y(t) = 
e(ki-E) Plugging in, we have W(t’) = T(t) = UKW(t) = UV*(t) 
= Ye iki-Et). since W has only one component, U is just a phase factor® n 
that we can choose to be 1. Rewriting, we have '(t) = e HRE+E) — gi(“k-x—Et) 
Indeed, W’ describes a plane wave moving in the opposite direction. Crucially, 
W’(t) « e7!£! and thus has positive energy as it should. Note that acting on a 
spinless particle T? = UKU K = UU*K* = +1. 

Next consider a spin } nonrelativistic electron. Acting with T on the spin-up 
state ( ) we want to obtain the spin-down state ( 9 ) . Thus, we need a nontrivial 


matrix U = no, to flip the spin: 


r(o)=¥(o)=" (4) 


Similarly, T acting on the spin-down state produces the spin-up state. Note that 
acting on a spin 4 particle 


T? = noyK no,K = nopn*ox KK =-1 


This is the origin of Kramer’s degeneracy: In a system with an odd number of 
electrons in an electric field, no matter how complicated, each energy level is 
twofold degenerate. The proof is very simple: Since the system is time reversal 
invariant, Y and TW have the same energy. Suppose they actually represent the 
same state. Then TW = e!“W, but then T2W = T(TW) = Te? WV =e "TU = 
w A -—W.So WV and TW must represent two distinct states. 

All of this is beautiful stuff, which as I noted earlier you could and should 
have learned in a decent course on quantum mechanics. My responsibility here 
is to show you how it works for the Dirac equation. Multiplying (1) by y° from 
the left, we have i(8/dt) y(t) = Hw(t) with H = —iy°y'd; + y°m. Once again, 
we want i(0/dr) y'(t’) = HW'(t) with wy’) = Ty(t) and T some operator 
to be determined. The discussion above carries over if T~'HT = H, that is, 
KU~'HUK = H. Thus, we require KU7!y°U K = y® and KU" Gy ®y)UK = 
iy°y'. Multiplying by K on the left and on the right, we see that we have to solve 
for aU such that U~!y°U = y™ and U—!y'U = —y'*. We now restrict ourselves 
to the Dirac and Wey] bases, in both of which y? is the only imaginary guy. Okay, 
what flips y! and y? but not y® and y?? Well, U = ny!y? (with 7 an arbitrary 
phase factor) works: 


vt) =ny 'yPKY®) (34) 


Since the y!’s are the same in both the Dirac and the Wey] bases, in either we have 
from (4) 


5It is a phase factor rather than an arbitrary complex number because we require that 
[WP = |W. 
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U =n(o! @it)(0? @it,) =nio* @1 


As we expect, acting on the 2-component spinors contained in 7, the time reversal 
operator T involves multiplying by io. Note also that as in the nonrelativistic case 
Tey =—-y. 

It may not have escaped your notice that y° appears in the parity operator (18), 
y” in charge conjugation (30), and y'y? in time reversal (34). If we change a Dirac 
particle to its antiparticle and flip spacetime, y> appears. 


CPT theorem 


There exists a profound theorem stating that any local Lorentz invariant field 
theory must be invariant under® CPT, the combined action of charge conjugation, 
parity, and time reversal. The pedestrian proof consists simply of checking that any 
Lorentz invariant local interaction you can write down [such as (25)], while it may 
break charge conjugation, parity, or time reversal separately, respects CPT. The 
more fundamental proof involves considerable formal machinery that I will not 
develop here. You are urged to read about the phenomenological study of charge 
conjugation, parity, time reversal, and CPJ, surely one of the most fascinating 
chapters in the history of physics.’ 


Two stories 


I end this chapter with two of my favorite physics stories—one short and one long. 

Paul Dirac was notoriously a man of few words. Dick Feynman told the story 
that when he first met Dirac at a conference, Dirac said after a long silence, “J have 
an equation; do you have one too?” 

Enrico Fermi did not usually take notes, but during the 1948 Pocono conference 
(see Chapter 1.7) he took voluminous notes during Julian Schwinger’s lecture. 
When he got back to Chicago, he assembled a group consisting of two professors, 
Edward Teller and Gregory Wentzel, and four graduate students, Geoff Chew, 
Murph Goldberger, Marshall Rosenbluth, and Chen-Ning Yang (all to become 
major figures later). The group met in Fermi’s office several times a week, a couple 
of hours each time, to try to figure out what Schwinger had done. After 6 weeks, 
everyone was exhausted. Then someone asked, “Didn’t Feynman also speak?” The 
three professors, who had attended the conference, said yes. But when pressed, 


° A rather pedantic point, but potentially confusing to some students, is that I distinguish 
carefully between the action of charge conjugation C and the matrix C: Charge conjugation 
€ involves taking the complex conjugate of y and then scrambling the components with 
Cy’®. Similarly, I distinguish between the operation of time reversal J and the matrix 7. 


See, e.g., J. J. Sakurai, Invariance Principles and Elementary Particles and E. D. 


Comming, Weak Interactions. 
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not Fermi, nor Teller, nor Wentzel could recall what Feynman had said. All they 
remembered was his strange notation: p with a funny slash through it. 


1.1.1. 


11.1.2. 


11.1.3. 


11.1.4. 


1.1.10. 


1.1.11. 


Exercises 


Show that the following bilinears in the spinor field yw, py“y, po’, 
vy"y>y, and yy>y transform under the Lorentz group and parity as a scalar, a 
vector, a tensor, a pseudovector Or axial vector, and a pseudoscalar, respectively. 
(Hint: For example, vy“y°?y > Wl1+ G/4ealy"y>[1 — @/4)ea]W under an 
infinitesimal Lorentz transformation and > wy°y“y>y°y under parity. Work 
out these transformation laws and show that they define an axial vector. 


Write all the bilinears in the preceding exercise in terms of yr; and yp. 
Solve ( p — m)y(p) = 0 explicitly (by rotational invariance it suffices to solve it 
for p along the 3rd direction, say). Verify that indeed x 1s much smaller than ¢ 


for a slowly moving electron. What happens for a fast moving electron? 


Exploiting the fact that y is much smaller than ¢ for a slowly moving electron, 
find the approximate equation satisfied by ¢. 


. For a relativistic electron moving along the z-axis, perform a rotation around the 


z-axis. In other words, study the effect of e~“/°"" on w(p) and verify the 
assertion in the text regarding y, and yp. 


. Show explicitly that (25) violates parity. 


. The defining equation for C evidently fixes C only up to an overall constant. Show 


that this constant is fixed by requiring (y¥,.). = wv. 


.8. Show that the charge conjugate of a left handed field is right handed and vice 


veTsa, 


. Show that Cy is a Lorentz scalar. 


Work out the Dirac equation in (1 + 1)-dimensional spacetime. 


Work out the Dirac equation in (2 + 1)-dimensional spacetime. Show that the 
apparently innocuous mass term violates parity and time reversal. [Hint: The three 
y’s are just the three Pauli matrices with appropriate factors of i .] 


8. N. Yang, Lecture at the Schwinger Memorial Session of the American Physical 
Society meeting in Washington D. C., 1995. 


Chapter [1.2 


Quantizing the Dirac Field 


Anticommutation 


We will use the canonical formalism of Chapter I.8 to quantize the Dirac field. 

Long and careful study of atomic spectroscopy revealed that the wave function 
of two electrons had to be antisymmetric upon exchange of their quantum numbers. 
It follows that we cannot put two electrons into the same energy level so that they 
will have the same quantum numbers. In 1928 Jordan and Wigner showed how this 
requirement of an antisymmetric wave function can be formalized by having the 
creation and annihilation operators for electrons satisfy anticommutation rather 
than commutation relations as in (1.8.12). 

Let us start out with a state with no electron |0) and denote by bt the operator 
creating an electron with the quantum numbers a. In other words, the state bt iO) 
is the state with an electron having the quantum numbers a. Now suppose we want 
to have another electron with the quantum numbers f, so we construct the state 
b,bt |O). For this to be antisymmetric upon interchanging a and 8 we must have 


{b), by} = bib) + bhbi =0 (1) 


Upon hermitean conjugation, we have {b,,, bg} = 0. In particular, b} bt = 0, so that 
we cannot create two electrons with the same quantum numbers. 
To this anticommutation relation we add 


{ba bp) = Sap 2) 


One way of arguing for this is to say that we would like the number operator to 
be N = }°,, b'b,, just as in the bosonic case. Show with one line of algebra that 
[AB. C])=A[B, C]+ [A, C]B or [AB, C] = A{B, C} — {A, C}B. (A heuristic 
way of remembering the minus sign in the anticommuting case is that we have 
to move C past B in order for C to do its anticommuting with A.) For the 
desired number operator to work we need [}-, bt by, bp = +b, (so that as usual 


N |0) = 0), and Nb} |0) =F {0)) and so we have (2). 
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The Dirac field 
Let us now turn to the free Dirac Lagrangian 
L=wig —m)v (3) 


The momentum conjugate to ¥ is 7, = 6£/60,%, =i t,. We anticipate that the 
correct canonical procedure requires imposing the anticommutation relation: 


~ to ~ 
{Wal®, 1), ¥g(0, t)} = 5 R) bag (4) 
We will derive this below. 
The Dirac field satisfies 
GJ —m)y =0 (5) 


Plugging in plane waves u(p, s)e—'P* and u(p, s)e'?* for yr, we have 


(p—m)u(p,s)=0 (6) 


and 
(p+m)v(p,s)=0 (7) 


The index s = +1 reminds us that each of these two equations has two solutions, 
spin up and spin down. Evidently, under a Lorentz transformation the two spinors 
u and v transform in the same way as ¥. Thus, if we define 4 =u'y® and i= 
v'y°, then Zu and vv are Lorentz scalars. 

This subject is full of “peculiar” signs and so I will proceed very carefully and 
show you how every sign makes sense. 

First, since (6) and (7) are linear we have to fix the normalization of u and v. 
Since u(p, s)u(p, s) and u(p, s)v(p, 5) are Lorentz scalars, the normalization 
condition we impose on them in the rest frame will hold in any frame. 

Our strategy is to do things in the rest frame using a particular basis and then 
invoke Lorentz invariance and basis independence. In the rest frame, (6) and 
(7) reduce to (y° — 1)u = 0 and (yv° + 1)v = 0. In particular, in the Dirac basis 
y= ( i 5, ) , so the two independent spinors u (labeled by spin s = +1) have 


the form 


and 


OOo Oo re 
OOo r& 


while the two independent spinors v have the form 
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0 0 
0 0 
1 | 24 | 9 
1 


0 


The normalization conditions we have implicitly chosen are then z(p, s)u(p, s) = 
1] and v(p, s)v(p, s) = —1. Note the minus sign thrust upon us. Clearly, we also 
have the orthogonality condition uv = 0 and vu = 0. Lorentz invariance and basis 
independence then tell us that these four relations hold in general. 

Furthermore, in the rest frame 


1 
Y walP. vir. 9)= (4 9) = 507" + Dag 
ap 


5 


and 
1 
DvP. ipl. )=(9 2) =50°- dep 
aB 


Thus, in general 


- _f{ ptm 
dX Ug(P, S)ig(p, 8) = ( om )., (8) 
and 
_ p-—m 
dX Ug(P. 5)Ug(P, 5) = ( on )., (9) 


Another way of deriving (8) is to note that the left hand side is a 4 by 4 matrix (it is 
like a column vector multiplied by a row vector on the right) and so must be a linear 
combination of the sixteen 4 by 4 matrices we listed in Chapter II.1. Argue that y> 
and y“y° are ruled out by parity and that o“” is ruled out by Lorentz invariance 
and the fact that only one Lorentz vector, namely p”, is available. Hence the right 
hand side must be a linear combination of p and m. Fix the relative coefficient by 
acting with » — m from the left. The normalization is fixed by setting a = 6 and 
summing over a. Similarly for o- In particular, setting a = B and summing over 
a, we recover U(p, s)v(p, s) = 

We are now ready to promote v ix) to an operator. In analogy with (1.8. 11) we 
expand the field in plane waves! 


v(x) = 


a 
[2 Llp. su(p, se* + d'(p, s)u(p, sye?*] (10) 
(27)2(E,/m)? 


'The notation is standard. See e.g., J. A. Bjorken and S. D. Drell, Relative Quantum 
Mechanics. 
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(Here E, = pp=+Vp?+m* and px = p,x".) The normalization factor 
(E,/ m)? is slightly different from that in (1.8.11) for reasons we will see. Oth- 
erwise, the rationale for (10) is essentially the same as in (1.8.11). We integrate 
over Momentum p, sum over spin s, expand in plane waves, and give names to the 
coefficients in the expansion. Because yf is complex, we have, just as in Exercise 
18.3, but unlike (1.8.11), a b operator and a d? operator. 

Just as in Exercise 1.8.3, the operators b and d‘ must carry the same charge. 
Thus, if 6 annihilates an electron with charge e = —|e|, d ¥ must remove charge e; 
that is, it creates a positron with charge —e = |e|. 

A word on notation: in (10) b(p, s), d'(p, Ss), u(p, s), and v(p, s) are written 
as functions of the 4-momentum p but strictly speaking they are functions of p 
only, with p° always understood to be +./ p? + m?. 

Thus let b'( p, s) and b(p, s) be the creation and annihilation operators for an 
electron of momentum p and spin s. Our introductory discussion indicates that we 
should impose 


{b(p, 8), b'(p", s')} = 8 OB — By (11) 
{b(p, s), b(p’, s‘)} =0 (12) 
{b'(p. 8), b'(p', 8} =0 (13) 


There is a corresponding set of relations for d'(p, s) and d(p, s) the creation and 
annihilation operators for a positron, for instance, 


{d(p,s), d"(p’, s')}=8O@B — pss (14) 
We now have to show that we indeed obtain (4). Write 
} d*p' tral Aarn! of PY Nnogyf of 
(0) = | ——— rr, sar’ s+ dr, 8000", 8‘) 
(27) 2 (E p/m)? “y 
Nothing to do but to plow ahead: 
(wz, 0), ¥0)} 


d>p 


7 (21)3(E,/m) due, s)a(p, s)e'?* + u(p, s)i(p, sel? *] 
P 


5 


if we take b and b' to anticommute with d and d‘. Using (8) and (9) we obtain 


-q ama f —2P _ “iB Sf — poi t 
WE.0, HO) = | Esl me + (p—meiP* 


=| d°p apy eiPF = 93) (3) 
(21)*(2Ep) 
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which is just (4) slightly disguised. 

Similarly, writing schematically, we have {w, w} = 0 and (yr, wr} = 0. 

The normalization factor (E,/m)? in (10) can of course be modified by ab- 
sorbing appropriate factors into u and v. Sometimes an astute student asks what 
happens with a massless spin } particle. We could use the normalization factor 


(QE »)? [bringing it in line with the normalization factor in (1.8.11)] and at the 
same time changing (8) and (9) to }’ uu = pand >> vi= Dp. 
s s 


Energy of the vacuum 


An important exercise at this point is to calculate the Hamiltonian starting with the 
Hamiltonian density 


Hanh Lewy -5+my (15) 


Inserting (10) into this expression and integrating, we have 


H= | PxH = | Oxp(iy -dI+m)yv = | Pxpiy’ (16) 
which works out to be 
H= | &p > E,lb (p,s)b(p, s) — d(p,s)d (p, s)] (17) 


We can see the all important minus sign in (17) schematically: In (16) ¥ gives 
a factor ~ (b’ +d), while 0/dt acting on w brings down a relative minus sign 
giving ~ (b — d'), thus giving us ~ (b' + d)(b— d') ~b'b —dd' (orthogonality 
between spinors vu = 0 kills the cross terms). 

To bring the second term in (17) into the right order, we anticommute 


—d(p, s)d'(p, s) =d'(p, s)d(p, 5) — 890) so that 


t= | pd Eb @. Nbp.s) +4, 9d. 9) 


| - 396) [ dp DE, (18) 


The first two terms tell us that each electron and each positron of momentum p 
and spin s has exactly the same energy E,,, as it should. But what about the last 
term? That 5°) (0) should fill us with fear and loathing. 

It is OK: Noting that 8p) = [1/(2m)3] f d3xe'P?, we see that 8°) (0) = 
[1/ (27)7] f d*x (we encounter the same maneuver in Exercise 1.8.2) and so the 
last term contributes to H 
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y= —3 fas | rp D2GE,) (19) 


(since in natural units we have fi = | and hence h = 27). We have an energy 
—4£, in cach unit-size phase-space cell (1 /h>)d>x d?p in the sense of statistical 
mechanics, for each spin and for the electron and positron separately (hence the 
factor of 2) . This infinite additive term Ep is precisely the analog of the zero 
point energy 4/w of the harmonic oscillator you encountered in your quantum 
mechanics course. But it comes in with a minus sign! 

The sign is bizarre and peculiar! Each mode of the Dirac field contributes — fiw 
to the vacuum energy. In contrast, each mode of a scalar field contributes fia as 
we saw in chapter I.8. This fact is of crucial importance in the development of 
supersymmetry, which we will discuss in Chapter VIII.4. 


Fermion propagating through spacetime 


In analogy with (1.8.14), the propagator for the electron is given by iSyg(x) = 
(0 T Ua(x)¥ip (0) |0), where the argument of y has been set to O by translation 
invariance. As we will see, the anticommuting character of y requires us to define 
the time-ordered product with a minus sign, namely 


T YX) O) = 0) ¥@)EO) — O(-x) FOV) (20) 


Referring to (10), we obtain for x9 >0, 


iS(x) = (01 vie¥0) 0) = f aotegm DM s)i(p, sye* 


p + Mm —ipx 
(20)3(E,/m) 2m 


For x° < 0, we have to be a bit careful about the spinorial indices: 


iSyp(x) = —(O| owe [0) 


TA-\3(/E Jy —ipx 
--| ones /m) d tg(p, S)vy(P, s)e 


_ eee p-—m —ipx 
| Qay(E,/m) am wb? 


using the identity (9). 
Putting things together we obtain 


iS(x) = 


fen e Px _ 0 p-m vor] 
| aterm [2 Fm 6 OR | AD 
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We will now show that this fermion propagator can be written more elegantly 
as a 4-dimensional integral: 


4 4 
iS(x) =i | OP oipx_Ptm _ =| d'p evipx___ (22) 
(2m)4 p?—m? +ie (2n)4 pomtie 


To show that (22) is indeed equivalent to (21) we go through essentially the 
same steps as after (1.8.14). In the complex p° plane the integrand has poles at 
p° =+4,/ p? +m? —ie ~ 4(E, — ie). For x° > 0 the factor e?P"x" tells us to 
close the contour in the lower half-plane. We go around the pole at +(E, — ié) 
clockwise and obtain 


. _ vn: dp —ip-x pom 
iS(x) = ( ai | Pie “IE, 


producing the first term in (21). For x° < 0 we are now told to close the contour in 


the upper half-plane and thus we go around the pole at —(E, — ig) anticlockwise. 
We obtain 


d*p sin sige 1 +5 
iS(x) =i? OP otiE px tipi (—E,y° — By +m) 


(2x) —2E, 
and flipping p we have 
isey=— [ 2 cies | (ey? — pp —my =— [SP ee p—m 
Qn) 2B, * (Qx)3 _2E, 


precisely the second term in (21) with the minus sign and all. Thus, we must define 
the time-ordered product with the minus sign as in (20). 

After all these steps, we see that in momentum space the fermion propagator 
has the elegant form 


i 
iS(p) = ————_ 23 
P) p-—mt+tie (23) 
This makes perfect sense: $(p) comes out to be the inverse of the Dirac operator 
p — m, justas the scalar boson propagator D(k) = 1/ (k? — m? + ig) is the inverse 
of the Klein-Gordon operator k* — m?. 


Poetic but confusing metaphors 


In closing this chapter let me ask you some rhetorical questions. Did I speak of an 
electron going backward in time? Did I mumble something about a sea of negative 
energy electrons? This metaphorical language, when used by brilliant minds, the 
likes of Dirac and Feynman, was evocative and inspirational, but unfortunately 
confused generations of physics students and physicists. The presentation given 


~ 
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here is in the modern spirit, which seeks to avoid these potentially confusing 
metaphors. 


Exercises 
11.2.1. Use Noether’s theorem to derive the conserved current J“ = wy". Calculate 
[Q, w}, thus showing that b and d‘ must carry the same charge. 


I1.2.2. Quantize the Dirac field in a box of volume of V and show that the vacuum energy 
Ep is indeed proportional to V. [Hint: The integral over momentum f d 3p is 
replaced by a sum over discrete values of the momentum. } 


Chapter II.3 
Lorentz Group and Weyl Spinors 


The Lorentz algebra 


In Chapter II.1 we followed Dirac’s brilliantly idiosyncratic way of deriving his 
equation. We develop here a more logical and mathematical theory of the Dirac 
spinor. A deeper understanding of the Dirac spinor not only gives us a certain satis- 
faction, but is also indispensable, as we will see later, in studying supersymmetry, 
one of the foundational concepts of superstring theory; and of course, most of 
the fundamental particles such as the electron and the quarks carry spin 5 and are 
described by spinor fields. 

Let us begin by reminding ourselves how the rotation group works. The three 
generators J; (i = 1,2, 3 or x, y, z) of the rotation group satisfy the commutation 
relation 


Li, Jil =tecjde (1) 


When acting on the spacetime coordinates, written as a column vector 


0 


oe te 


1 
2 
3 


the generators of rotations are represented by the hermitean matrices 


000 0 
000 0 
A=l9 00 -i 2) 
00: 0 


with J, and J; obtained by the appropriate permutations. You should verify by 
laboriously multiplying these three matrices that (1) is satisfied. Note that the signs 
of J; are fixed by the commutation relation (1). 

Now add the Lorentz boosts. A boost in the x =x! direction transforms the 
spacetime coordinates: 


t' = (cosh gv) t+ (sinh 9) x; x’ = (sinh g) t+ (cosh ¢) x (3) 
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or for infinitesimal g 
t=t+ox; x=x+ot (4) 


In other words, the infinitesimal generator of a Lorentz boost in the x direction is 
represented by the hermitean matrix (x° = ¢ as usual) 


010 0 
100 0 

Mi=19 0 0 0 (5) 
00 0 0 

Similarly, 

0010 
000 0 

=11 0 0 0 6) 
000 0 


I leave it to you to write down K3. 

Check that [J;, K ;]=i¢€;;,K, . This just says that the boost generators K; 
transform as a 3-vector under rotation, as you would expect. 

You are now about to do one of the most significant calculations in the history 
of twentieth century physics. By brute force compute [K,, K>]. You will discover 
that this is equal to —i J;. Two Lorentz boosts produce a rotation! (You might recall 
from your course on electromagnetism that this mathematical fact is responsible 
for the physics of the Thomas precession.) 

Mathematically, the generators of the Lorentz group satisfy the following alge- 
bra [known to the cognoscenti as SO(3, 1) J: 


Ji, J = iid (7) 
[J;, Kj] = ie;;, Ky (8) 
[K;, K ]= —tejj. J (9) 


Note the all-important minus sign! 

How do we study this algebra? The crucial observation is that the algebra falls 
apart into two pieces if we form the combinations J,; = 4(J; +1K;). You should 
check that 


[Fyis Jy jl Stee J an (10) 
[J_i, J_ l= tip J_5 (11) 
and most remarkably 


[Jai J_jJ=0 (12) 
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This last commutation relation tells us that J, and J_ form two separate 
SU (2) algebras. The mathematically sophisticated say that the algebra 5O(3, 1) 
is isomorphic to SU(2) @ SU (2). 


From algebra to representation 


This means that we can simply use what we have already learned about the 
representation of SU(2) to determine all the representations of SO(3, 1). As 
you know, the representations of SU (2) are labeled by j = 0, 4, 1, 3,---. We 
can think of each representation as consisting of (27 + 1) objects w,, with m = 
—j,-jt+l,---, 7-1, 7 that transform into each other under SU (2). It follows 
immediately that the representations of SO (3, 1) are labeled by (j*, j~) with j* 
and j~ each taking on the values 0, }, 1, 3, ... . Each representation consists of 
(27+ + 1)(2j— +1) objects Wy. 4,,- With mt =—jt, jt +1,..., 74-1, 77 
andm” =—j°,—-—j +1,...,j°>-Lyj-. 

Thus, the representations of SO(3, 1) are (0,0), (4,0), ©, 4), (1, 0), ©, 1), G, 
4), and so on, in order of increasing dimension. We recognize the 1-dimensional 
representation (0, 0) as clearly the trivial one, the Lorentz scalar. By counting 
dimensions, we expect that the 4-dimensional representation (4, 1) has to be the 
Lorentz vector, the defining representation of the Lorentz group (see Exercise 
HI.3.1). 


Spinor representations 


What about the representation (4, 0)? Let us write the two objects as %, with 
a = 1, 2. Well, what does the notation (4, 0) mean? It says that J,; = 4(J; +iK;) 
acting on Wy is represented by 40; while J_; = 4(J, —iK;) acting on yy is 
represented by 0. By adding and subtracting we find that 


J; = 36; (13) 
and 
iK; = $9; (14) 


where the equal sign means “represented by” in this context. (By convention we do 
not distinguish between upper and lower indices on the 3-dimensional quantities 
J;, K;, and o;.) 

Similarly, let us denote the two objects in (0, 1) by the peculiar symbol x*. 
I should emphasize the trivial but potentially confusing point that unlike the bar 
used in Chapter II.1, the bar on x “is a typographical element: Think of the symbol 
X as a letter in the Hittite alphabet if you like. Similarly, the symbol & bears no 
relation to a: we do not obtain & by operating on @ in any way. The rather strange 
notation is known informally as “dotted and undotted” and more formally as the 
van der Waerden notation—a bit excessive for our rather modest purposes at this 
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point but I introduce it because it is the notation used in supersymmetric physics 
and superstring theory. (Incidentally, Dirac allegedly said that he wished he had 
invented the dotted and undotted notation.) Repeating the same steps as above, you 
will find that on the representation (0, +) we have J; = 40; andi kK; = —10;.The 
minus sign is crucial. 

The 2-component spinors y,, and ¥“are called Weyl spinors and furnish per- 
fectly good representations of the Lorentz group. Why then does the Dirac spinor 
have 4 components? 

The reason is parity. Under parity, ¥ > —% and p > —7, and thus J > J 
and K — —K, and so J, <> J_. In other words, under parity the representations 
(4,0) < (0, 1), Therefore, to describe the electron we must use both of these 
2-dimensional representations, or in mathematical notation, the 4-dimensional 
reducible representation (4, 0) @ (0, 3). 

We thus stack two Weyl spinors together to form a Dirac spinor 


_ ({ ¥a 
_ ( % ) (15) 


The spinor W(p) is of course a function of 4-momentum p [and by implication 
also y,(p) and x°(p)] but we will suppress the p dependence for the time being. 
Referring to (13) and (14) we see that acting on W the generators of rotation 


- (16 0 
— 2 
i= (4 3) 


where once again the equality means “represented by,” and the generators of boost 


Note once again the all-important minus sign. 

Parity forces us to have a 4-component spinor but we know on the other hand 
that the electron has only two physical degrees of freedom. Let us go to the rest 
frame. We must project out two of the components contained in W(p,) with the 
rest momentum p, = (m, 0). With the benefit of hindsight, we write the projection 
operator as P = 4(1 — y°). You are probably guessing from the notation that y? 
will turn out to be one of the gamma matrices, but at this point, logically y° is 
just some 4 by 4 Anatrix. The condition P? = ? implies that (y°)? = 1 so that the 
eigenvalues of y° are +1. Since yy <> x% under parity we naturally guess that w, 
and x%correspond to the left and right handed fields of Chapter II.1. We cannot 
simply use the projection to set for example “to 0. Parity means that we should 
treat y, and on the same footing. We choose 


09 {01 


or more explicitly 
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0 
0 
\ 
0 


=a OOO 


1 O 
0 1 
0 0 
0 0 
(Different choices of y° correspond to the different basis choices discussed in 


Chapter II. 1.) In other words, in the rest frame y,— x¥* = 0. The projection to two 
degrees of freedom can be written as 


(y° — )W(p,) =0 (16) 


Indeed, we recognize this as just the Weyl basis introduced in Chapter II.1. 


The Dirac equation 


We have derived the Dirac equation, a bit in disguise! 

Since our derivation is based on a step-by-step study of the spinor representation 
of the Lorentz group, we know how to obtain the equation satisfied by &(p) for 
any p: We simply boost. Writing W(p) = e~!#* w(p,), we have (19K 0 eK _ 

1) W(p) = 0. Introducing the notation y“p,,/m = eK 1,0¢iGK | we obtain the 
Dirac equation 


(vy"p,, — m)¥(p) =0 (17) 


You can work out the details as an exercise. 

The derivation here represents the deep group theoretic way of looking at the 
Dirac equation: It is a projection boosted into an arbitrary frame. 

Note that this is an example of the power of symmetry, which pervades modern 
physics and this book: Our knowledge of how the electron field transforms under 
the rotation group, namely that it has spin +, allows us to know how it transforms 
under the Lorentz group. Symmetry rules! 

In Appendix E we will develop the dotted and undotted notation further for later 
use in the chapter on supersymmetry. 

In light of your deeper group theoretic understanding it is a good idea to reread 
Chapter II. 1 and compare it with this chapter. 


Exercises 
II.3.1. Show by explicit computation that (4, +) is indeed the Lorentz vector. 


11.3.2. Work out how the six objects contained in the (1, 0) and (0, 1) transform under 
the Lorentz group. Recall from your course on electromagnetism how the electric 
and magnetic fields E and B transform. Conclude that the electromagnetic field 
in fact transforms as (1, 0) © (0, 1). Show that it is parity that once again forces 
us to use a reducible representation. 
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Show that 


and. / 
ef =coshy+é-¢sinhy 


with the unit vector ¢ = ¢/g. Identifying p = m¢ sinh ¢, derive the Dirac equa- 


tion. Show that 
, 0 o 
i I 
= (“ 0 ) 


Show that a spin-3 particle can be described by a vector-spinor V,,,, namely 
a Dirac spinor carrying a Lorentz index. Find the corresponding equations of 
motion, known as the Rarita-Schwinger equations. [Hint: The object V,,,, has 16 
components, which we need to cut down to 2 - 3 + 1= 4 components.] 


Chapter I1.4 


Spin-Statistics Connection 


There is no one fact in the physical world which has a greater impact on 
the way things are, than the Pauli exclusion principle.' 


Degrees of intellectual incompleteness 


In a course on nonrelativistic quantum mechanics you learned about the Pauli 
exclusion principle” and its later generalization stating that particles with half 
integer spins, such as electrons, obey Fermi-Dirac statistics and want to stay apart, 
while in contrast particles with integer spins, such as photons or pairs of electrons, 
obey Bose-Einstein statistics and love to stick together. From the microscopic 
structure of atoms to the macroscopic structure of neutron stars, a dazzling wealth 
of physical phenomena would be incomprehensible without this spin-statistics 
rule. Many elements of condensed matter physics, for imstance, band structure, 
Fermi liquid theory, superfluidity, superconductivity, quantum Hall effect, and so 
on and so forth, are consequences of this rule. 

Quantum statistics, one of the most subtle concepts in physics, rests on the 
fact that in the quantum world, all elementary particles and hence all atoms, are 
absolutely identical to, and thus indistinguishable from, one other. It should be 
recognized as a triumph of quantum field theory that it is able to explain absolute 
identity and indistinguishability easily and naturally. Every electron in the universe 


1T. Duck and E. C. G. Sudarshan, Pauli and the Spin-Statistics Theorem, p. 21. 

? While a student in Cambridge, E. C. Stoner came to within a hair of stating the exclusion 
principle. Pauli himself in his famous paper (Zeit. f, Physik 31: 765, 1925) only claimed to 
“summarize and generalize Stoner’s idea.’ However, later in his Nobel Prize lecture Pauli 
was characteristically ungenerous toward Stoner’s contribution. A detailed and fascinating 
history of the spin and statistics connection may be found in Duck and Sudarshan, op. cit. 


3 Barly in life, I read in one of George Gamow’s popular physics books that he could 
not explain quantum statistics—all he could manage for Fermi statistics was an analogy, 
invoking Marlene Dietrich’s famous remark “I vont to be alone.”—and that one would have 
to go to school to learn about it, Perhaps this spurs me, later in life, to write popular physics 
books also. See A. Zee, Einstein’s Universe, p. x. 
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is an excitation in one and the same electron field yy. Otherwise, one might be able 
to imagine that the electrons we now know came off an assembly line somewhere 
in the early universe and could all be slightly different owing to some negligence 
in the manufacturing process. 

While the spin-statistics rule has such a profound impact in quantum mechanics, 
its explanation had to wait for the development of relativistic quantum field theory. 
Imagine a civilization that for some reason developed-quantum mechanics but has 
yet to discover special relativity. Physicists in this civilization eventually realize 
that they have to invent some rule to account for the phenomena mentioned above, 
none of which involves motion fast compared to the speed of light. Physics would 
have been intellectually unsatisfying and incomplete. 

One interesting criterion in comparing different areas of physics is their degree 
of intellectual incompleteness. 

Certainly, in physics we often accept a rule that cannot be explained until we 
move to the next level. For instance, in much of physics, we take as a given the 
fact that the charge of the proton and the charge of electron are exactly equal 
and opposite. Quantum electrodynamics by itself is not capable of explaining 
this striking fact either. This fact, charge quantization, can only be deduced by 
embedding quantum electrodynamics into a larger structure, such asa grand unified 
theory, as we will see in Chapter VII.6. (In Chapter IV.4 we will learn that the 
existence of magnetic monopoles implies charge quantization, but monopoles do 
not exist in pure quantum electrodynamics.) 

Thus, the explanation of the spin statistics connection, by Fierz and by Pauli in 
the late 1930s, and by Liiders and Zumino and by Burgoyne in the late 1950s, ranks 
as one of the great triumphs of relativistic quantum field theory. I do not have the 
space to give a general and rigorous proof* here. I will merely sketch what goes 
terribly wrong if we violate the spin-statistics connection. 


The price of perversity 


A basic quantum principle states that if two observables commute then they are 
simultaneously diagonalizable and hence observable. A basic relativistic principle 
states that if two spacetime points are spacelike with respect to each other then no 
signal can propagate between them, and hence the measurement of an observable 
at one of the points cannot influence the measurement of another observable at the 
other point. 

Consider the charge density Jy = i (y' dp — Aye" ¢) in a charged scalar field 
theory. According to the two fundamental principles just enunciated, Jo(x, t = 
0) and Jo(y, t =0) should commute for x 4 ¥. In calculating the commutator 
of Jo(x, f = 0) with Jo(¥, ft = 0), we simply use the fact that g(x, t = 0) and 


4 See I. Duck and E. C. G. Sudarshan, Pauli and the Spin-Statistics Theorem, and R. F. 
Streater and A. S. Wightman, PCT, Spin Statistics, and All That. 
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dg (x, t = 0) commute with o(y¥, t = 0) and dgy(y, t = 0), so we just move the 
field at X steadily past the field at ¥. The commutator vanishes almost trivially. 

Now suppose we are perverse and quantize the creation and annihilation oper- 
ators in the expansion (1.8.11) 


[ae * + ate (1) 


(x 1=0)= | _ ak 
“ J (27) 2a, 


according to anticommutation rules 
{a(k), a'(@)} = 8" — 4) 
and 
{ak), a)) =0 = {al &), at} 


instead of the correct commutation rules. 

What is the price of perversity? 

Now when we try to move Jo(¥, ¢ = 0) past Jy(X, t =0), we have to move the 
field at » past the field at ¥ using the anticommutator 


{p(x, t =0), oy, t =0)} 
“lh imo 
Ba yP2ir [i2x) 2a, 
(lathe! + ate), faGye4 + at Ge PY 


ak ike) + en hE) (2) 
(Qn )P 2a, 

You see the problem? In a normal scalar-field theory that obeys the spin- 
statistics connection, we would have computed the commutator, and then in the 


last expression in (2) we would have gotten (eik-@-9) _ ei E-9)) instead of 
(ei EY) 4 ek E-N)_ The integral 


D = - 
_ ak ike) _ -ik Hy 
(27) ?20, 


would obviously vanish and all would be well. With the plus sign, we get in 
(2) a nonvanishing piece of junk. A disaster if we quantize the scalar field as 
anticommuting! A spin 0 field has to be commuting. Thus, relativity and quantum 
physics join hands to force the spin statistics connection. 

It is sometimes said that because of electromagnetism you do not sink through 
the floor and because of gravity you do not float to the ceiling, and you would be 
sinking or floating in total darkness were it not for the weak interaction, which 
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regulates stellar burning. Without the spin statistics connection, electrons would 
not obey Pauli exclusion. Matter would just collapse.” 


Exercise 
11.4.1. Show that we would also get into trouble if we quantize the Dirac field with com- 


mutation instead of anticommutation rules. Calculate the commutator [J °CX, 0), 
J°(0)]. 


5 The proof of the stability of matter, given by Dyson and Lenard, depends crucially on 
Pauli exclusion. 


Chapter II.5 


Vacuum Energy, Grassmann Integrals, 
and Feynman Diagrams for Fermions 


The vacuum is a boiling sea of nothingness, full of sound and fury, 
signifying a great deal. 
—Anonymous 


Fermions are weird 


I developed the quantum field theory of a scalar field g(x) first in the path integral 
formalism and then in the canonical formalism. In contrast, I have thus far devel- 
oped the quantum field theory of the free spin-} field y(x) only in the canonical 
formalism. We learned that the spin-statistics connection forces the field operator 
w(x) to satisfy anticommutation relations. This immediately suggests something 
of a mystery in writing down the path integral for the spinor field y. In the path in- 
tegral formalism w(x) is not an operator but merely an integration variable. How 
do we express the fact that its operator counterpart in the canonical formalism 
anticommutes? 

We presumably cannot represent y as a commuting variable, as we did 9g. 
Indeed, we will discover that in the path integral formalism 7 is to be treated 
not as an ordinary complex number but as a novel kind of mathematical entity 
known as a Grassmann number. 

If you thought about it, you would realize that some novel mathematical struc- 
ture is needed. In Chapter 1.3 we promoted the coordinates of point particles q;(t) 
in quantum mechanics to the notion of a scalar field g(x, ¢). But you already know 
from quantum mechanics that a spin } particle has the peculiar property that its 
wave function turns into minus itself when rotated through 27. Unlike particle 
coordinates, half integral spin is not an intuitive concept. 


Vacuum energy 
To motivate the introduction of Grassmann-valued fields I will discuss the notion 
of vacuum energy. The reason for this apparently strange strategy will become 


clear shortly. 
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Quantum field theory was first developed to describe the scattering of photons 
and electrons, and later the scattering of particles. Recall that in Chapter I.7 while 
studying the scattering of particles we encountered diagrams describing vacuum 
fluctuations, which we simply neglected (see Fig. 1.7.7). Quite naturally, particle 
physicists considered these fluctuations to be of no importance. Experimentally, 
we scatter particles off each other. Who cares about fluctuations in the vacuum 
somewhere else? It was only in the early 1970s that physicists fully appreciated 
the importance of vacuum fluctuations. We will come back to the importance of 
the vacuum! in a later chapter. 

In Chapters 1.8 and II.2 we calculated the vacuum energy of a free scalar field 
and of a free spinor field using the canonical formalism. To motivate the use of 
Grassmann numbers to formulate the path integral for the spinor field I will adopt 
the following strategy. First, I use the path integral formalism to obtain the result 
we already have for the free scalar field using the canonical formalism. Then we 
will see that in order to produce the result we already have for the free spinor field 
we must modify the path integral. 

By definition, vacuum fluctuations occur even when there are no sources to 
produce particles. Thus, let us consider the generating functional of a free scalar 
field theory in the absence of sources:” 


4 


_ / Doe! f d4x 1[(ag)*—m?9?] —¢ (sagt =) —C ovat log(a?-+m?) (1) 


For the first equality we used (1.2.15) and absorbed inessential factors into the 
constant C’. In the second equality we used the important identity 


det M = elt los M (2) 


which you encountered in Exercise 1.10.2. 

Recall that Z = (0|e~'“7 0) (with T — oo understood so that we integrate 
over all of spacetime in (1)), which in this case is just e~/“7 with E the energy of 
the vacuum. Evaluating the trace in (1) 


TrO= J d*x(x|O |x) 


= f a's fS £4 iy (el OlgNalx) 
(ny _ 


we obtain 


'Indeed, we have already discussed one way to observe the effects of vacuum fluctua- 
tions in Chapter 1.8. 


? Strictly speaking, to render the expressions here well defined we should replace m? by 


m* — ie as discussed earlier. 
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d*k 


ony log(k? —m?+ie) +A 


iET =35VT 


where A is an infinite constant corresponding to the multiplicative factor C in 
(1). Recall that in the derivation of the path integral we had lots of divergent 
multiplicative factors; this is where they can come in. The presence of A is a good 
thing here since it solves a problem you might have noticed: The argument of the 
log is not dimensionless. Let us define m’ by writing 


a=-yvr [ ak log(k? ~ m” + ie) 
° (2m)4 


In other words, we do not calculate the vacuum energy as such, but only the 
difference between it and the vacuum energy we would have had if the particle had 
mass m’ instead of m. The arbitrarily long time T cancels out and E is proportional 
to the volume of space V, as might be expected. Thus, the (difference in) vacuum 
energy density is 


E i d*k kh? ~m? + ie 
—- | —— log | -—_.——_ (3) 
V 2 (27r)* k2 — m2 +ie 
if @k f dw w* —w, + ie 
—e log a 
2/ QxyJ Qn 


ww? +ie 


where w, = +V/k? + m’. We treat the (convergent) integral over w by integrating 
by parts: 


[2% 0 ot so tie =-2 f Mo) — ~ > 0p 
2m dw e w — wi? + ie 20 w* — we +ie k 


; 1 ! 
= ~1204( SE) _ (wy —> @,) 
= +1 (w, — @,) (4) 


Indeed, restoring i we get the result we want: 


E ak 
= -| ony he ~ zh) (5) 


We had to go through a few arithmetical steps to obtain this result, but the important 
point is that using the path integral formalism we have managed to obtain a result 
previously obtained using the canonical formalism. 
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A peculiar sign for fermions 


Our goal to is to figure out the path integral for the spinor field. Recall from Chapter 
II.2 that the vacuum energy of the spinor field comes out to have the opposite sign 
to the vacuum energy of the scalar field, a sign that surely ranks among the “top 
ten” signs of theoretical physics. How are we to get it using the path integral? 

As explained in Chapter 1.3, the origin of (1) lies in the simple Gaussian 
integration formula 


+00 
| dxe-2™ = 27 — Jie" 2 "84 
a 


-OO 


Roughly speaking, we have to find a new type of integral so that the analog of the 


— oo 1 
Gaussian integral would go something like e* 2 '°8 7. 


Grassmann math 


It turns out that the mathematics we need was invenied long ago by Grassmann. 
Let us postulate a new kind of number, called the Grassmann or anticommuting 
number, such that if 7 and € are Grassmann numbers, then 7€ = —&n. In particular, 
n* = 0. Heuristically, this mirrors the anticommutation relation satisfied by the 
spinor field. Grassmann assumed that any function of 7 can be expanded in a Taylor 
series. Since n* = 0, the most general function of 7 is f(y) =a + bn. 

How do we define integration over 7? Grassmann noted that an essential prop- 
erty of ordinary integrals is that we can shift the dummy integration variable: 
[2S dxf (x +0) = [7° dxf (x). Thus, we should also insist that the Grassmann 
integral obey the rule f d nf (n+ &) = f d nf(y), where & is an arbitrary Grass- 
mann number. Plugging into the most general function given above, we find that 
f dnb& =0. Since & is arbitrary this can only hold if we define { dnb =0 for any 
ordinary number d, and in particular f dn = f dnl=0 

Since given three Grassmann numbers x, 7, and &, we have x(n) = (n&) x, 
that is, the product (7€) commutes with any Grassmann number x, we feel that 
the product of two anticommuting numbers should be an ordinary number. Thus, 
the integral f dyn is just an ordinary number that we can simply take to be 1: 
This fixes the normalization of dy. Thus Grassmann integration is extraordinarily 
simple, being defined by two rules: 


| dn =0 © 
and 
/ dnn =1 (7) 


With these two rules we can integrate any function of 7: 
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[ante = [ anat+on =» (8) 


if b is an ordinary number so that /(7) is Grassmannian, and 


[angen = [ ana + bm =-6 (9) 


if b is Grassmannian so that f(7) is an ordinary number. Note that the concept of 
a range of integration does not exist for Grassmann integration. It is much easier 
to master Grassmann integration than ordinary integration! 

Let 7 and 7 be two independent Grassmann numbers and a an ordinary number. 
Then the Grassmannian analog of the Gaussian integral gives 


[an f aieton = f an f aia +iian) = f dnan=a=et 4 (10) 


Precisely what we had wanted! 
We can generalize immediately: Let n = (7), 72,---,y) be N Grassmann 
numbers, and similarly for 7; we then have 


[anf aier*” der A (11) 


for A = {A;;} an antisymmetric N by N matrix. (Note that contrary to the bosonic 
case, the inverse of A need not exist.) We can further generalize to a functional 
integral. 

As we will see shortly, we now have all the mathematics we need. 


Grassmann path integral 


In analogy with the generating functional for the scalar field 
z= | DoeiS@) = | Doe! { tx 3109) (n?ieng" 
we would naturally write the generating functional for the spinor field as 
z= | Dy DpeSoD = | Dy | Die { Axo Gg-msiow 


Treating the integration variables y and yw as Grassmann-valued Dirac spinors, 
we immediately obtain 


Z= | Dy | Dpel f Uxves-miow — CO! detig — m + ie) 


= Cet logig—mtie) (12) 
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where C’ is some multiplicative constant. Using the cyclic property of the trace, 
we note that (here m is understood to be m — ie) 


trlog(ig — m) = trlog y°(i9 — m)y° = tr log(—ig — m) 
= J [tr log(ig — m) + trlog(—ig — m)] 


dtr log(a? + m?). (13) 


Thus, Z = C’ eit log@?-+m?—ie) [compare with (1)!]. 

We see that we get the same vacuum energy we obtained in Chapter II.2 using 
the canonical formalism if we remember that the trace operation here contains a 
factor of 4 compared to the trace operation in (1), since (ig — m) is a4 by 4 matrix. 

Heuristically, we can now see the necessity for Grassmann variables. If we 
were to treat y and y as complex numbers in (12), we would obtain something 
like (1/det[i# — m]) = e—* 29-™) and so have the wrong sign for the vacuum 
energy. We want the determinant to come out in the numerator rather than in the 
denominator. 


Dirac propagator 


Now that we have learned that the Dirac field is to be quantized by a Grassmann 
path integral we can introduce Grassmannian spinor sources 7 and 7 : 


Z(n,h) = | Dy Dei J A xtha—my titel (14) 


and proceed pretty much as before. Completing the square just as in the case of 
the scalar field, we have 


wKw +a t+ bn = + 9K DKW +K'n) — aK 'n (15) 
and thus 
Zn, #) = Ce HEI (16) 


The propagator $(x) for the Dirac field is the inverse of the operator @¥ — m): in 
other words, S(x) is determined by 


J — m)S(x) = 8O (x) (17) 
As you can verify, the solution is 
d‘p ie 1P* 
(2x)* p—m+ie 


iS(x)= (18) 


in agreement with (II.2.22). 
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Figure 11.5.1 


Feynman rules for fermions 


We can now derive the Feynman rules for fermions in the same way that we derived 
the Feynman rules for a scalar field. For example, consider the theory of a scalar 
field interacting with a Dirac field 


L=biy"d, —m)v + 519) — n¢7]—Ag* + fovy (19) 


The generating functional 
Zn, 4, J) = | Dy De Doel VO f Cx roriveon (20) 


can be evaluated as a double series in the couplings A and f. The Feynman rules 
(not repeating the rules involving only the boson) are as follows: 


1. Draw a diagram with straight lines for the fermion and dotted lines for the 
boson, and label each line with a momentum, for example, as in Figure IT.5.1. 
2. Associate with each fermion line the propagator 


— fb; ptm (21) 

p—m+ie p*—m’+ie 

3. Associate with each interaction vertex the coupling factor if and the factor 
(27)48O(., P — Dou P) eXpressing momentum conservation (the two 
sums are taken over the incoming and the outgoing momenta, respectively). 

4. Momenta associated with internal lines are to be integrated over with the 
measure {[d* p/(27)"]. 

5. External lines are to be amputated. For an incoming fermion line write 
u(p, s) and for an outgoing fermion line u(p’, s). The sources and sinks 
have to recognize the spin polarization of the fermion being produced and 
absorbed. [For antifermions, we would have u(p, s) and u(p’, s).] 

6. A factor of (—1) is to be associated with each closed fermion line. 
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p+k p 


iey" 


Figure 1.5.2 


Note that rule 6 is unique to fermions, as we derived earlier in this chapter. The 
Feynman diagram corresponding to vacuum fluctuation has no external line. I will 
discuss these points in detail in Chapter IV.3. 

For the theory of a massive vector field interacting with a Dirac field mentioned 
in Chapter II.1 


Sw, v, A) = liv" (0, —ieA,) — ml — 4 F,, Fe — 4p7A, A" (22) 


the rules differ from above as follows. The vector boson propagator is given by 


i kyky 
poe ee) ” 


and thus each vector boson line is associated not only with a momentum, but also 
with indices jz and v. The vertex (Figure II.5.2) is associated with iey”. 


p pt+k p 


Figure I1.5.3 
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Exercises 


11.5.1. Write down the Feynman amplitude for the diagram in Fig. H.5.1 for the scalar 
theory (19). The answer is given in Chapter III.3. 


11.5.2. Applying the Feynman rules for the vector theory (22) show that the amplitude 
for the diagram in Fig. 11.5.3 is given by 


d*k 1 kk 
- \2:2 Bhp 
CO" | onie—e ( ye ~ 84») 


p+k+m 


—____y# 24 
(p +? — m2” u(p) (24) 


u(p)y” 


Chapter IT.6 


Electron Scattering and Gauge Invariance 


Electron scattering 


We will now finally calculate a physical process that experimentalists can go out 
and measure. Consider two electrons scattering off each other. To order e” the 
relevant Feynman diagrams are given in Figure II.6.1a,b, in which a photon is 
exchanged between the two electrons. 

But wait, from Chapter I.5 we only know the propagator i D,,,, = [i/ (k? — 47)] 
(k,k,/u? — nyy) for a hypothetical massive photon. (Trivial notational change: 
The mass of the photon is now called jz since m is reserved for the mass of the 
electron.) In Chapter I.5 I outlined my philosophy: We will plunge ahead and 
calculate with a nonzero yz and hope that at the end we can set jz equal to zero. 
Indeed, when we calculated the potential energy between two external charges we 
found that we can let ~ — 0 without any sign of trouble [see (1.5.6)]. In this and 
the next chapter we would like to see whether this will always be the case. 

Applying the Feynman rules, we obtain the amplitude for the diagram in Figure 
IL.6.1a (with k = P; — p, the momentum transfer in the scattering): 


A(P,, P)) = (ie. Gea - rw) i(P,)y"u(p,)a(P»)y"u(pr) 
(P, — pp)? —p2 \ 
= (ie) Go piu pda YuMPD (a) 
17 Fi _ 


We have used 
k,u(P)y"u(py) = (Py — Ppt Poy" u(py) =u(P))( Pi — BYu(p) 
= u(P))(m — m)u(p,) =0 (2) 


by virtue of the equations of motion satisfied by u(P,) and u(p,). 

This important observation implies that the k,,k,,/ y” term in the photon prop- 
agator does not enter and hence we can set the photon mass jz equal to zero with 
impunity. Thus, 


- 2 
ie - - 
A(P), P2) = Py ppt tPoy iu (pd Pr)¥uM(P2) 
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P, P, 


P; P2 


{a) 


(b) 


Figure 1.6.1 


Note that the identity that allows us to set 42 equal to zero is just the mo- 
mentum space version of electromagnetic current conservation d,,/* = 0, with 
J# = wy "w. You may notice that this calculation is intimately related to the one 
we did in going from (1.5.4) to (1.5.5), with u(P;)y“u(p;) playing the role of 
J#(k). 

We have indicated the dependence of A only on the final momenta. By Fermi 
Statistics, the amplitude for the diagram in Figure II.6.1b is then —A(P), P)). 
Thus, the amplitude for two electrons of momentum p, and p, to scatter into two 
electrons with momentum P, and P» is 
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M = A(P;, Pp) — A(Po, Py) (3) 
To obtain the cross section we have to square the amplitude 
IMI? = [|ACP), Po)? + (Py <> Pp)]—2Re A(P2, P)*ACP;, Po) (4) 


At this point we have to do a fair amount of arithmetic, but keep in mind that 
there is nothing conceptually intricate in what follows. First, the term 


|A(P;, Py)? = 
_ p\)* 


[u(P))y"u (pe (py) "u (PNA (P2)¥,.4 (po) (po) yu (Po) (5) 


factorizes with one factor involving spinors carrying momentum with subscript 
1 and another factor involving spinors carrying momentum with subscript 2. In 
contrast, the interference term A(P2, P,)*A(P;, P2) does not factorize. 

In the simplest experiments, the initial electrons are unpolarized, and the polar- 
ization of the outgoing electrons is not measured. We can then average over initial 
spins and sum over final spins, using (II.2.8): 


ptm 
2m 


Y u(p, s)i(p, 8) = (6) 


AY 


Thus, in averaging and summing |A(P;, P>)|? we encounter the object (displaying 
the spin labels explicitly) 


(PL, Pi) => > a(Py, S)y“u(p,, s)a(P1, 8)v"U(P, S) 7) 
s § 


1 
a Pit m)y*( pi +m)y" 
~ (2m) 
which is to be multiplied by 1,,,(P2, p2). Similarly, in averaging and summing 
A(P3, P;)*A(P;, P2) we encounter the more involved object 


x= dD 
iP )y“u(paPa)ryu(da(P)Y'UCPaP»Y MCP) —-B) 


where for simplicity of notation we have suppressed the spin labels. Applying (6) 
we can also write x as a trace. . 

Well, we, or rather you, have to develop some technology for evaluating the 
trace of products of gamma matrices. The key observation is that the square of 
a gamma matrix is either +1 or —1 and different gamma matrices anticommute. 
Clearly, the trace of a product of an odd number of gamma matrices vanishes. Thus, 


t!’(Py, py) = ——[u( Pyy" py”) +m? try“) 


(2m 7 
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Writing 
t( Pay” By”) = Pippa y?y*y*y”) 


and noting that the trace of a product of an even number of gamma matrices is 
shown in Appendix D, we obtain 


t!’(P), Pi) = 5 (PP) — Py Py + Py py + m?yH’) (9) 


4 
(2m)? 

Furthermore, since there are only four different gamma matrices, the trace of 
a product of six gamma matrices can always be reduced to the trace of a product 
of four gamma matrices, since there are always pairs of gamma matrices that are 
equal and can be brought together by anticommuting. Similarly for the trace of a 
product of even higher numbers of gamma matrices. The evaluation of x is quite 
tedious, since it involves traces of products of up to eight gamma matrices. We 
will be content to obtain x in the relativistic limit in which m may be neglected 
compared to the momenta: 


= Om or, tC Piv” Biv” Povy BaYv) (10) 


Applying the identities in Appendix D to (10) we obtain 
tC Py” piy” Poy, Boy) =—2 0 Pry” By Ba¥y Po) = —32P)- P2Pi- Po 


In the same limit 


te°(P), Pp) = oe y (Pip; + Py py — "Py: Py) 
and thus 
t#*(P1, Pi)Tuy(Po, Pr) 
= 1° (pip + PY pit — nl” P,- p)(2PS pS — nf!" Pa Pr) 
(my! 1° Py » P> FQ: P2 
_ 16- 


= Om et P2P,- P+ Py: Popa: Pi) 


While it is not among the purposes of this book to teach you relativistic 
kinematics and to calculate cross sections it is character building to occasion- 
ally push calculations to the bitter end. In the center-of-mass frame in the rela- 
tivistic limit p,; = EC, 0,0, 1), pp = EC, 0, 0, —1), P; = E(1, sin@, 0, cos @), 
and P, = E(1, — sin @, 0, — cos @). Hence, p,- po = P, - P, =2E’, py Py = 
p2- Px =2E? sin"(6/2), and p,- P) = Pa. P, = 2E” cos*(@/2). Also, in this 
limit (P, — p,)* = (—2p; - P,)? = 16E* sin*(@/2). Putting it together, we obtain 
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1X, Ys IM? = (e4/4m4) £ (0), where 


_ 1+cos*@/2) 2 i+ sin*(6/2) 


—_——______. 1 
sin* (6/2) sin’(@/2) cos?(6/2) cos(6/2) qh) 


f@) 


The physical origin of each of these terms is clear. The first term strongly favors 
forward scattering owing to the photon propagator ~ 1/k blowing up at k ~ 0. The 
third term is required by the indistinguishability of the two outgoing electrons: The 
Scattering must be symmetric under 6 + a — @ since experimentalists can’t tell 
whether a particular incoming electron has scattered forward or backward. The 
second term is the most interesting of all: It comes from quantum interference. 
If we had mistakenly thought that electrons are bosons and taken the plus sign 
in (3), the second term in f(@) would come with a minus sign. This makes a big 
difference: For instance f (2/2) would be 5 — 8 + 5 =2 instead of 5+ 8+5= 18. 

Since the conversion of a squared probability amplitude to a cross section is 
conceptually the same as in nonrelativistic quantum mechanics (divide by the 
incoming flux, etc.) I will relegate the appropriate formula to Appendix C and 
let you go the last few steps and obtain the differential cross section as an exercise: 


do e- 2 ] 
“-(<) apo!) (12) 


An amazing business 
When you think about it, theoretical physics is truly an amazing business. After 
the appropriate equipment is assembled and high energy electrons are scattered off 
each other, experimentalists indeed would find the differential cross section given 
in (12). There is something almost magical about it. 
Exercises 


1.6.1. Verify all the statements made in the text. 


11.6.2. Compute the cross section given in (12). 


Chapter II.7 


Diagrammatic Proof of Gauge Invariance 


Gauge invariance 


Conceptually, rather than calculate cross sections, we have the more important task 
of proving that we can indeed set the photon mass yz equal to zero with impunity 
in calculating any physical process. With z = 0, the Lagrangian given in Chapter 
II.1 becomes the Lagrangian for quantum electrodynamics: 


L=pliy"(@, —ieA,) — my — 7 F,)FY (1) 


We are now ready for one of the most important observations in the history of 
theoretical physics. Behold, the Lagrangian is left invariant by the gauge transfor- 
mation 


W(x) > EAM W(x) (2) 
and 
Ay(x) > Ay (x) + Fen IAgy git) = A, (x) + =A (2) (3) 
which implies 
Fyy(x) > Fy) (4) 


You are of course already familiar with (3) and the invariance of F.,,,, from classical 
electromagnetism. 

In contemporary theoretical physics, gauge invariance’ is regarded as funda- 
mental and all important, as we will see later. The modern philosophy is to look 
at (1) as a consequence of (2) and (3). If we want to construct a gauge invariant 
relativistic field theory involving a spin-; and a spin-1 field, then we are forced to 
quantum electrodynamics. 


1 


! The discovery of gauge invariance was one of the most arduous in the history of physics. 
Read J. D. Jackson and L. B. Okun, “Historical roots of gauge invariance,” Rev. Mod. Phys. 
73, 2001 and learn about the sad story of a great physicist whose misfortune in life was that 
his name differed from that of another physicist by only one letter. 
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You will notice that in (3) I have carefully given two equivalent forms. While 
it is simpler, and commonly done in most textbooks, to write the second form, 
we should also keep the first form in mind. Note that A(x) and A(x) + 27 give 
exactly the same transformation. Mathematically speaking, the quantities e'4® 
and d,,A(x) are well defined, but A(x) is not. 

After these apparently formal but actually physically important remarks, we are 
ready to work on the proof. I will let you give the general proof, but I will show 
you the way by working through some representative examples. 

Recall that the propagator for the hypothetical massive photon is iD,, = 
i(kyky/ - Suv) (kK? — 7). We can set the yu? in the denominator equal to 
zero without further ado and write the photon propagator effectively as 1D, = 
i(k, k,/u? — g,y)/k°. The dangerous term is k,,k,/”. We want to show that it 
goes away. 


A specific example 


First consider electron-electron scattering to order e*. Of the many diagrams, focus 
on the two in Figure II.7.1.a. The Feynman amplitude is then 


—e a 1 I a a Kk, _ ) 
u(p) (v pa kom! +’ Wom kom’ ) rs ( | BMY Dy 
(5) 


where I,,, is some factor whose detailed structure does not concern us. For the 
specific case shown in Figure II.7.1a we can of course write out [',,, explicitly if 
we want. Note the plus sign here from interchanging the two photons since photons 
obey Bose statistics. 

Focus on the dangerous term. Contracting the #(p’)(- - -)u(p) factor in (5) with 
k,, we have 


-~,;_t a 1 1 x 
u(y stom Bt Gyan) (6) 
The trick is to write the X in the numerator of the first term as (p+ & — m) — 
(p — m), and in the numerator of the second term as (p’ — m) — (p’— k —m). 
Using (p — m)u(p) = 0 and a (p’)(p’ — m) = 0, we see that the expression in (6) 
vanishes. This proves the theorem in this simple example. But since the explicit 
form of I’, ,, did not enter, the proof would have gone through even if Figure II.7.1a 
were replaced by the more general Figure II.7.1b, where arbitrarily complicated 
processes could be going on under the shaded blob. 

Indeed we can generalize to Figure I1.7.1c. Apart from the photon carrying 
momentum k that we are focusing on, there are already n photons attached to the 
electron line. These n photons are just “spectators” in the proof in the same way 
that the photon carrying momentum k’ in Figure II.7.1a never came into the proof 
that (6) vanishes. The photon we are focusing on can attach to the electron line in 
n + 1 different places. You can now extend the proof as an exercise. 


II.7. Diagrammatic Proof of Gauge Invariance 137 


{a) 


(b) 


Figure II.7.1 


Photon landing on an internal line 


In the example we just considered, the photon line in question lands on an external 
electron line. The fact that the line is “capped at the two ends” by u(p’) and u(p) is 
crucial in the proof. What if the photon line in question lands on an internal line? 

An example is shown in Figure II.7.2, contributing to electron-electron scatter- 
ing in order e®. The figure contains three distinct diagrams. The electron “on the 
left” emits three photons, which attach to an internal electron loop. The electron 
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(c) 


Figure II.7.1 (continued) 


“on the right” emits a photon with momentum k, which can attach to the loop in 
three distinct ways. 

Since what we care about is whether the kk, / u piece in the photon propagator 
E(k, Kp/ we Sup)! k* goes away or not, we can for our purposes replace that 
photon propagator by k,,. To save writing slightly, we define p; = p + q, and 
P2 = Pp, +4 (see the momentum labels in Figure II.7.2): Let’s focus on the 
relevant part of the three diagrams, referring to them as A, B, and C. 


d‘p ( 1 1 A 1 I ) 
A= poo A 7 
ou Yi kam’ Fakom’ preom*p—m) 


d‘p (v 1 1 i , 1 ) 
B= tr yy? — 8 
ny "\” oa kom’ pitkom p,m’ pom ®) 


and 


d‘p ( 1 1 1 , 1 ) 
C= tr | y”» ———_— ° 9 
| oe ” byt kom’ f,—m? pi—m" p-—m ” 


This looks like an unholy mess, but it really isn’t. We use the same trick we 
used before. In C write = ( po+ K — m) — (B2 — m), so that 


_ f a‘p yl ol a1 
c= | a fe (y py —m" pi-m* ia) 


vy 1 o 1 ra I )] 
(» Po+ k—m* pi-m* p-—m ©) 
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os . 


Figure II.7,2 
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In B write k = ( p,+ k-—m) — ( p,—m), so that 


| 1 i , 1 
B= tr vo —“*"="ewe 
| ow % Bot k—m" pi—m" b—m 


1 ; ol . 1 )] 
—tr ———_ y* ———— 11 
(" byt km! pit k—m’ p—m ” 


Finally, in A write k = (p+ k—m)—-—(p—m) 


_ f ap |e ( y 1 cs 1 , 1 ) 
SL Ort pp kom” ppt k—m" pom 


_ yo 7 —)] 12 
«(> Bot k—m" pt k—m" pt kom “) 


Now you see what is happening. When we add the three diagrams together 
terms cancel in pairs, leaving us with 


je (»° 1 yo 1 y —) 
Bo-m pi-m p—m 


y 1 o 1 A 1 )] 
—tr ey Fy A 13 
(y bot kom’ prk-m’ prgom)|°” 


4 
AtB+c= | ap 
x4 


(2. 


If we shift (see Exercise II.7.2) the dummy integration variable p — p — k in the 
second term, we see that the two terms cancel. Indeed, the k,,k,/ ye” piece in the 
photon propagator goes away and we can set uz = 0. 

I will leave the general proof to you. We have done it for one particular process. 
Try it for some other process. You will see how it goes. 


Ward-Takahashi identity 


Let’s summarize. Given any physical amplitude T“""*(k, - - -) with external elec- 
trons on shell [this is jargon for saying that all necessary factors u(p) and u(p) are 
included in T#*"*(k, - - -)] describing a process with a photon carrying momentum 
k coming out of, or going into, a vertex labeled by the Lorentz index 4, we have 


ky TY (k, ++) =0 (14) 
This is sometimes known as a Ward-Takahashi identity. 


The bottom line is that we can write i D,,, = —ig"” /k* for the photon prop- 
agator. Since we can discard the k,,k,,/ yu” term in the photon propagator 
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i(kyky/ — 8yuyv)/ k* we can also add in a ky ky /k? term with an arbitrary co- 
efficient. Thus, for the photon propagator we can use 

ky 
ke 


Dw = a= - b| (15) 
where we can choose the number & to simplify our calculation as much as possible. 
Evidently, the choice of € amounts to a choice of gauge for the electromagnetic 
field. In particular, the choice € = 1 is known as the Feynman gauge, and the choice 
&€ = 0 is known as the Landau gauge. If you find an especially nice choice, you can 
have a gauge named after you as well! For fairly simple calculations, it is often 
advisable to calculate with an arbitrary &. The fact that the end result must not 
depend on € provides a useful check on the arithmetic. 

This completes the derivation of the Feynman rules for quantum electrodynam- 
ics: They are the same rules as those given in Chapter II.5 for the massive vector 
boson theory except for the photon propagator given in (15). 

We have given here a diagrammatic proof of the gauge invariance of quantum 
electrodynamics. We will worry later (in Chapter IV.7) about the possibility that 
the shift of integration momentum used in the proof may not be allowed in some 
cases. 


The longitudinal mode 


We now come back to the worry we had in Chapter I.5. Consider a massive spin 
1 meson moving along the z—direction. The 3 polarization vectors are fixed by the 
condition kre, = 0 with k* = (a, 0, 0, k) (recall chapter I.5) and the normalization 
e*e, = —1, so that e{” = (0, 1, 0, 0), & = (0, 0, 1,0), € = (—k, 0, 0, w)/u. 
Note that as 2 — 0, the longitudinal polarization vector e) becomes proportional 
to k, = (w, 0, 0, —k). The amplitude for emitting a meson with a longitudinal 


polarization in the process described by (14) is given by e) TR = (-kTO + 


wT?) p= (RTO + EF ETP pe & (HATO + e+ ETP) (For 
yu <k), namely —k, TT? /u + fT? with k, = (k, 0, 0, —k). Upon using (14) 
we see that the amplitude To + O0asu—> 0. 

The longitudinal mode of the photon does not exist because it decouples from 
all physical processes. 

Here is an apparent paradox. Mr. Boltzmann tells us that in thermal equilibrium 
each degree of freedom is associated with }7. Thus, by measuring some thermal 
property (such as the specific heat) of a box of photon gas to an accuracy of 2/3 an 
experimentalist could tell if the photon is truly massless rather than have a mass 
of a zillionth of an electron volt. 

The resolution is of course that as the coupling of the longitudinal mode vanishes 
as 4 —> Othe time it takes for the longitudinal mode to come to thermal equilibrium 
goes to infinity. Our crafty experimentalist would have to be very patient. 
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Exercises 


Extend the proof to cover Figure II.7.1c. [Hint: To get oriented, note that Figure 
Il.7.1b corresponds to n = 1.] 


You might have worried whether the shift of integration of variable is allowed. 
Rationalizing the denominators in the first integral 


d‘p y 1 o 1 x, I 
tr 
| oe Y py —m" Bi-m" p-m 


in (13) and imagining domg the trace, you can convince yourself that this integral 
is only logarithmically divergent and hence that the shift is allowed. This issue 
will come up again in Chapter (IV.7). 


PART Il 


RENORMALIZATION 
AND 
GAUGE INVARIANCE 


Chapter II. 1 
Cutting Off Our Ignorance 


Who is afraid of infinities? Not I, I just cut them off. 
—Anonymous 


An apparent sleight of hand 


The pioneers of quantum field theory were enormously puzzled by the divergent 
integrals that they often encountered in their calculations, and they spent much of 
the 1930s and 1940s struggling with these infinities. Many leading lights of the 
day, driven to desperation, advocated abandoning quantum field theory altogether. 
Eventually, a so-called renormalization procedure was developed whereby the 
infinities were argued away and finite physical results were obtained. But for many 
years, well into the late 1960s and even the 1970s many physicists looked upon 
renormalization theory suspiciously as a sleight of hand. Jokes circulated that in 
quantum field theory infinity is equal to zero and that under the rug in a field 
theorist’s office had been swept many infinities. 

Eventually, starting in the 1970s a better understanding of quantum field theory 
was developed through the efforts of Ken Wilson and many others. Field theorists 
gradually came to realize that there is no problem of divergences in quantum field 
theory at all. We now understand quantum field theory as an effective low energy 
theory in a sense I will explain briefly here and in more detail in Chapter VIIL3. 


Field theory blowing up 


We have to see an infinity before we can talk about how to deal with infinities. 
Well, we saw one in Chapter I.7. Recall that the order 47 correction (1.7.23) to the 
meson-meson scattering amplitude diverges. With K =k, + k2, we have 


d*k 1 1 


a 1 
(21)4 k2 — m2 + ie (K —k)? — m2 + ie 1) 


M =3(—iA)*i? 


As I remarked back in Chapter I.7, even without doing any calculations we 
can see the problem that confounded the pioneers of quantum field theory. The 
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integrand goes as 1/k* for large k and thus the integral diverges logarithmically as 
f{d*k/(21)*\(1/k*). (The ordinary integral {°° dr r” diverges linearly for n = 0, 
quadratically for n = 1, and so on, and { °° ar diverges logarithmically.) Since 
this divergence is associated with large values of k it is known as an ultraviolet 
divergence. 

To see how to deal with this apparent infinity, we have to distinguish between 
two conceptually separate issues, associated with the terrible names “regulariza- 
tion” and “renormalization” for historical reasons. 


Parametrization of ignorance 


Suppose we are studying quantum electrodynamics instead of this artificial g* 
theory. It would be utterly unreasonable to insist that the theory of an electron 
interacting with a photon would hold to an arbitrarily high energy. At the very 
least, with increasingly higher energies other particles come in, and eventually 
electrodynamics becomes merely part of a larger electroweak theory. Indeed, these 
days it is thought that as we go to higher and higher energies the whole edifice of 
quantum field theory will ultimately turn out to be an approximation to a theory 
whose identity we don’t yet know, but probably a string theory according to some 
physicists. 

The modern view is that quantum field theory should be regarded as an effective 
low energy theory, valid up to some energy (or momentum in a Lorentz invariant 
theory) scale A. We can imagine living in a universe described by our toy ¢* theory. 
As physicists in this universe explore physics to higher and higher momentum 
scales they will eventually discover that their universe is a mattress constructed 
out of mass points and springs. The scale A is roughly the inverse of the lattice 
spacing. 

When I teach quantum field theory, I like to write “Ignorance is no shame” 
on the blackboard for emphasis when I get to this point. Every physical theory 
should have a domain of validity beyond which we are ignorant of the physics. 
Indeed were this not true physics would not have been able to progress. It is a good 
thing that Feynman, Schwinger, Tomonaga, and others who developed quantum 
electrodynamics did not have to know about the charm quark for example. 

Temphasize that A should be thought of as physical, parametrizing our threshold 
of ignorance, and not as a mathematical construct.! Indeed, physically sensible 
quantum field theories should all come with an implicit A. If anyone tries to sell 
you a field theory claiming that it holds up to arbitrarily high energies, you should 


! We saw a particularly vivid example of this in Chapter 1.8. When we define a conducting 
plate as a surface on which a tangential electric field vanishes, we are ignorant of the 
physics of the electrons rushing about to counter any such imposed field. At extremely 
high frequencies, the electrons can’t rush about fast enough and new physics comes in, 
namely that high frequency modes do not see the plates. In calculating the Casimir force 
we parametrize our ignorance with a ~ A7!. 
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check to see if he sold used cars for a living. (As I wrote this, a colleague who is 
an editor of Physical Review Letters told me that he worked as a garbage collector 
during high school vacations, adding jokingly that this experience prepared him 
well for his present position.) 

Thus, in evaluating f d*k /(22)* in (1) we should integrate only up to A, known 
as a cutoff. We literally cut off the momentum integration (Figure Iff.1.1).2 The 
integral is said to have been “regularized.” 

Since my philosophy in this book is to emphasize the conceptual rather than the 
computational, I will not actually do the integral but merely note that it is equal to 
2iC log(A?/K7) where C is some numerical constant that you can compute if you 
want (see Appendix 1 to this chapter). For the sake of simplicity I also assume that 
m? << K* so that we can neglect m? in the integrand. It is convenient to define 
the kinematic variables s = K* = (ki + ko)’, t = (k, — k3)*, and u = (k; — ky)’. 
(Writing out the k ;’s explicitly in the center-of-mass frame, you see that s, f, and 
u are related to rather mundane quantities such as the center-of-mass energy and 
the scattering angle.) After all this, the meson-meson scattering amplitude reads 


2 2 2 
M =—id +iCA7[log (~) + log (~) + log (“~)s +007) (2) 


2 A. Zee, Einstein’s Universe, p. 204. Cartooning schools apparently teach that physicists 
in general, and quantum field theorists in particular, all wear lab coats. 
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This much is easy enough to understand. After regularization, we speak of 
cutoff-dependent quantities instead of divergent quantities, and M depends log- 
arithmically on the cutoff. 


What is actually measured 


Now that we have dealt with regularization, let us turn to renormalization, a terrible 
word because it somehow implies we are doing normalization again when in fact 
we haven’t yet. 

The key here is to imagine what we would tell an experimentalist about to 
measure Meson-meson scattering. We tell her (or him if you insist) that we need a 
cutoff A and she is not bothered at all; to an experimentalist it makes perfect sense 
that any given theory has a finite domain of validity. 

Our calculation is supposed to tell her how the scattering will depend on the 
center-of-mass energy and the scattering angle. So we show her the expression in 
(2). She points to A and exclaims, “What in the world is that?” 

We answer, “The coupling constant,” but she says, “What do you mean, cou- 
pling constant, it’s just a Greek letter!” 

A confused student, Confusio, who has been listening in, pipes up, “Why the 
fuss? I have been studying physics for years and years, and the teachers have 
shown us lots of equations with Latin and Greek letters, for example, Hooke’s 
law F = --kx, and nobody gets upset about k being just a Latin letter.” 

Smart Experimentalist: “But that is because if you give me a spring I can go 
out and measure k. That’s the whole point! Mr. Egghead Theorist here has to tell 
me how to measure this 2.” 

Woah, that is a darn smart experimentalist. We now have to think more carefully 
what a coupling constant really means. Think about a, the coupling constant of 
quantum electrodynamics. Well, it is the coefficient of 1/r in Coulomb’s law. Fine, 
Monsieur Coulomb measured it using metallic balls or something. But a modern 
experimentalist could just as well have measured & by scattering an electron at 
such and such an energy and at such and such a scattering angle off a proton. We 
explain all this to our experimentalist friend. 

SE, nodding, agrees: “Oh yes, recently my colleague so and so measured the 
coupling for meson-meson interaction by scattering one meson off another at such 
and such an energy and at such and such a scattering angle, which correspond to 
your variables s, ¢, and u having values so, to, and uy. But what does the coupling 
constant my colleague measured, let us call it Ap, with the subscript meaning 
“physical,” have to do with your theoretical 2, which, as far as I am concerned, is 
just a Greek letter in something you call a Lagrangian!” 

Confusio, “Hey, if she’s going to worry about small lambda, I am going to worry 
about big lambda. How do I know how big the domain of validity is?” 

SE: “Confusio, you are not as dumb as you look! Mr. Egghead Theorist, if I 
use your formula (2), what is the precise value of A that I am supposed to plug in? 
Does it depend on your mood, Mr. Theorist? If you wake up feeling optimistic, do 
you use 2A instead of A? And if your girl friend left you, you use 4A?” 
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We assert, “Ha, we know the answer to that one. Look at (2): M is supposed to 
be an actual scattering amplitude and should not depend on A. If someone wants to 
change A we just shift A in such a way so that M does not change. In fact, a couple 
lines of arithmetic will show you precisely what di/dA has to be (see Exercise 
IIl.1.3).” 

SE: “Okay, so A is secretly a function of A. Your notation is lousy.” 

We admit, “Exactly, this bad notation has confused generations of physicists.” 

SE: “I am still waiting to hear how the A p my experimental colleague measured 
is related to your 4.” 

We say, “Aha, that’s easy. Just look at (2), which is repeated here for clarity and 
for your reading convenience: 


2 2 2 
M=—iaA +iCr? EE (~) + log (~) + log (~)| + O(a?) (3) 


According to our theory, Ap is given by 


2 2 2 
—iXkp=—iA+ iCr og (~) + log (*) + log (+) + O(a) (4) 
So to Ug 


To show you clearly what is involved, let us denote the sum of logarithms in the 
square bracket in (3) and in (4) by L and by Lo, respectively, so that we can write 
(3) and (4) more compactly as 


M=—iA +ica*L + O(a) (5) 
and 
~iAp =—id +iCA7Ly + O(A°) (6) 


That is how Ap and A are related.” 

SE: “If you give me the scattering amplitude expressed in terms of the physical 
coupling A p then it’s of use to me, but it’s not of use in terms of 4. I understand 
what A p is, but not A.” 

We answer: “Fine, it just takes two lines of algebra to eliminate A in favor of 
Ap. Big deal. 

Solving (6) for 4 gives 


ik =—idp —iCM*Ly + O(A3) = —iAp —iCAZL9 + O04) 7) 


The second equality is allowed to the order of approximation indicated. Now plug 
this into (5) 


M = —iA +iCa?L + O(3) =—iAp —iCA%Ly +iCAZL + O(Ap) (8) 


Please check that all manipulations are legitimate up to the order of approximation 
indicated.” 


150 lil, Renormalization and Gauge Invariance 


The “miracle” 


Lo and behold! The miracle of renormalization! 

Now in the scattering amplitude M we have the combination L — Ly = 
[log(sg/s) + log(to/t) + log(ug/u)]. In other words, the scattering amplitude 
comes out as 


M =—-iAp + iC, oe (2) + log (*) + log (*)| + O(A3) (9) 
5 u 


We announce triumphantly to our experimentalist friend that when the scatter- 
ing amplitude is expressed in terms of the physical coupling constant A p as she 
had wanted, the cutoff A disappears completely! 


The answer should always be in terms of 
physically measurable quantities 


The lesson here is that we should express physical quantities not in terms of 
“fictitious” theoretical quantities such as A, but in terms of physically measurable 
quantities such as Ap. 

By the way, in the literature, Ap is often denoted by Ap and for historical 
reasons called the “renormalized coupling constant.” I think that the physics of 
“renormalization” is much clearer with the alternative term “physical coupling 
constant,” hence the subscript P. We never did have a “normalized coupling 
constant.” 

Suddenly Confusio pipes up again; we have almost forgotten him! 

Confusio: “You started out with an M in (2) with two unphysical quantities 4 
and A, and their “unphysicalness” sort of cancel each other out.” 

SE: “Yeah, it is reminiscent of what distinguishes the good theorists from the 
bad ones. The good ones always make an even number of sign errors, and the bad 
ones always make an odd number.” 


Integrating over only the slow modes 


In the path integral formulation, the scattering amplitude M discussed here is 
obtained by evaluating the integral (Chapter 1.7) 


[> (xo (xs) 0(x3)0 (age FA (Z1G9)? mg" 0) 


The regularization used here corresponds roughly to restricting ourselves, in the 
integral f Dg, to integrating over only those field configurations g(x) whose 
Fourier transform ¢(k) vanishes for k = A. In other words, the fields corresponding 
to the internal lines in the Feynman diagrams in Fig (1.7.10) are not allowed to 
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fluctuate too energetically. We will come back to this path integral formulation 
later when we discuss the renormalization group. 


Alternative lifestyles 


I might also mention that there are a number of alternative ways of regularizing 
Feynman diagrams, each with advantages and disadvantages that make them 
suitable for some calculations but not others. The regularization used here, known 
as Pauli-Villars, has the advantage of being physically transparent. Another often 
used regularization is known as dimensional regularization. We pretend that we 
are calculating in d-dimensional spacetime. After the Feynman integral has been 
beaten down to a suitable form, we do an analytic continuation in d and set 
d =4 at the end of the day. The cutoff dependences of various integrals now 
show up as poles as we let d -» 4. Just as the cutoff A disappears when the 
scattering amplitude is expressed in terms of the physical coupling constant Ap, 
in dimensional regularization the scattering amplitude expressed in terms of Ap 
is free of poles. While dimensional regularization proves to be useful in certain 
contexts as I will note in a later chapter, itis considerably more abstract and formal 
than Pauli-Villars regularization. Each to his or her own taste when it comes to 
regularizing. 

Since the emphasis in this book is on the conceptual rather than the com- 
putational, I won’t discuss other regularization schemes but will merely sketch 
how Pauli-Villars and dimensional regularizations work in two appendices to this 
chapter. 


Appendix 1: Pauli-Villars regularization 


The important message of this chapter is the conceptual point that when physical amplitudes 
are expressed in term of physical coupling constants the cutoff dependence disappears. The 
actual calculation of the Feynman integral is unimportant. But I will show you how to do 
the integral just in case you would like to do Feynman integrals for a living. 

Let us start with the convergent integral 


4 _ 
(2m)4 (k2 —c*2 +ie)3 = 32m2c?2 
The dependence on c* follows from dimensional analysis. The overall factor is calculated 
in Appendix D. 
Applying the identity (D.15) 


1 1 


1 
— = da —_—_—_—__—_ 11 
xy Tax + (1—a)yP an 


152 li. Renormalization and Gauge Invariance 


to (1) we have 


4 1 
maine? f ak [ aus 
2 (2x) Jo D 


with 
D=a(K —k)? + (1—a@)k* — mm +i =(k —aK)* +a(1 —a@)K? — mm? + ie. 


Shift the integration variable k > k + aK and we meet the integral [[d*k/(27)*] 
[1/(k? — c? + ie)*], where c? = m? — a(1 — a)k?. Pauli-Villars proposed replacing it by 


ak Feseeenr _ raerend (12) 
(27)4 L(k2—c? +ie)? (kh? — A? +i)? 


For & much smaller than A the added second term in the integrand is of order A~* and is 
negligible compared to the first term since by prior agreement A is much larger than c. For 
k much larger than A, the two terms almost cancel and the integrand vanishes rapidly with 
increasing k, effectively cutting off the integral. 

Upon differentiating (12) with respect to c? and using (10) we deduce that (12) must be 
equal to (i /1672) log(A?/c). Thus, the integral 


A 4 . 2 
Ce or (4) (13) 
(27)4 (k2 —c2 +58)? 1672 c2 


is indeed logarithmically dependent on the cutoff, as anticipated in the text. 
For what it is worth, we obtain 


ot | 7 (sa=aqsaReaa) 
M=—-— | da log ( ———*__—_ 14 
32n2 Jo 108 m2 —a(1—a)K2 —ie (14) 


Appendix 2: Dimensional regularization 


The basic idea behind dimensional regularization is very simple. When we reach J = 
f[d*k/(2x)*I[1/(k? — c* + ie)] we rotate to Euclidean space and generalize to d dimen- 
sions (see Appendix D): 


1d) =i dk 1 _; at /2 1 i ml 
SS (n)4 2 +022 LT d/2)] 27)4 Jo (k2 + c2)? 


As I said, I don’t want to get bogged down in computation in this book, but we’ve got to do 
what we’ve got to do. Changing the integration variable by setting k? + c* = c*/x we find 


co 1] 1 
dk za! _ 1,d—4 [ dx(1—-x d/2—lyl~d/2 
f rape 2 fy OO”) 


which we are supposed to recognize as the integral representation of the beta function. After 
the dust settles, we obtain 
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dtk 1 _; 1 r (54) (15) 
(21 )4 (k2 +. ¢2)2 (44/2 2 


As d — 4, the nght-hand side becomes 


1.2 5 


where y = 0.577 - - - denotes the Euler-Mascheroni constant. 

Comparing with (13) we see that log A? in Pauli-Villars regularization has been effec- 
tively replaced by the pole 2/(4 — d). As noted in the text, when physical quantities are 
expressed in terms of physical coupling constants, all such poles cancel. 


Exercises 
III.1.1. Work through the manipulations leading to (9) without referring to the text. 


1.1.2. Regard (1) as an analytic function of K*. Show that it has a cut extending from 
4m? to infinity. (Hint: If you can’t extract this result directly from (1) look at (14).] 


[1L.1.3. Change A to e*® A. Show that for M not to change, to the order indicated 4 must 
change by 6A = 6eCA? + O(a3), that is, 


dad 2 3 
A— =6C)° + O(a 16 
ZA A") (16) 


Chapter II.2 


Renormalizable versus 
Nonrenormalizable 


Old view versus new view 


We learned that if we were to write the meson meson scattering amplitude in terms 
of a physically measured coupling constant Ap, the dependence on the cutoff A 
would disappear (at least to order n3). Were we lucky or what? 

Well, it turns out that there are quantum field theories in which this would hap- 
pen and that there are quantum field theories in which this would nothappen, which 
gives us a binary classification of quantum field theories. Again, for historical rea- 
sons, the former are known as “renormalizable theories” and are considered “nice.” 
The latter are known as “nonrenormalizable theories,” evoking fear and loathing 
in theoretical physicists. 

Actually, with the new view of field theories as effective low energy theories 
to some underlying theory, physicists now look upon nonrenormalizable theories 
in a much more sympathetic light than a generation ago. I hope to make all these 
remarks Clear in this and a later chapter. 


High school dimensional analysis 


Let us begin with some high school dimensional analysis. In natural units in 
which f# = 1 and c = 1, length and time have the same dimension, the inverse 
of the dimension of mass (and of energy and momentum). Particle physicists 
tend to count dimension in terms of mass as they are used to thinking of energy 
scales, Condensed matter physicists, on the other hand, usually speak of length 
scales. Thus, a given field operator has (equal and) opposite dimensions in particle 
physics and in condensed matter physics. We will use the convention of the particle 
physicists. 

Since the action S = f{ d*x£ appears in the path integral as e'®, it is clearly 
dimensionless, thus implying that the Lagrangian (Lagrangian density, strictly 
speaking) £ has the same dimension as the 4th power of a mass. We will use 
the notation [2] = 4 to indicate that £ has dimension 4. In this notation [x] = —1 
and [4] = 1. Consider the scalar field theory & = 1[(8g)? — m?y7] — Ag’. For 
the term (dg)* to have dimension 4, we see that [y] = 1 (since 2(1 + [y]) = 4). 
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This then implies that [A] = 0, that is, the coupling 4 is dimensionless. The rule is 
simply that for each term in £, the dimensions of the various pieces, including the 
coupling constant and mass, have to add up to 4 (thus, e.g., [A] + 4[g] = 4). 

How about the fermion field y? Applying this rule to the Lagrangian & = 
wiy"d,w +--+ we see that [y] = 3. (Henceforth we will suppress the - - - ; it is 
understood that we are looking at a piece of the Lagrangian. Furthermore, since we 
are doing dimensional analysis we will often suppress various uelevant factors, 
such as numerical factors and the gamma matrices in the Fermi interaction that we 
will come to presently.) Looking at the coupling f pw we see that the Yukawa 
coupling f is dimensionless. In contrast, in the theory of the weak interaction 
with £ = Gwwwy we see that the Fermi coupling G has dimension —2 (since 
—2+4(3) =4; got that’). 

From the Maxwell Lagrangian —jF',,, 4” we see that [A,,]= 1 and hence A,, 
has the same dimension as 0,,: The vector field has the same dimension as the scalar 
field. The electromagnetic coupling eA wre tells us that ¢ is dimensionless, 
which we can also deduce from Coulomb’s law written in natural units V(r) =a/r, 
with the fine structure constant a = e*/4z. 


Scattering amplitude blows up 


We are now ready for a heuristic argument regarding the nonrenormalizability of 
a theory. Consider Fermi’s theory of the weak interaction. Imagine calculating 
the amplitude M for a four-fermion interaction, say neutrino-neutrino scattering 
at an energy much smaller than A. In lowest order, M ~ G. Let us try to write 
down the amplitude to the next order: M ~ G + G*(?), where we will try to guess 
what (7?) is. Since all masses and energies are by definition small compared to the 
cutoff A, we can simply set them equal to zero. Since [G] = —2, by high school 
dimensional analysis the unknown factor (7) must have dimension +2. The only 
possibility for (?) is A?. Hence, the amplitude to the next order must have the form 
M ~ G + G*A2. We can also check this conclusion by looking at the Feynman 
diagram in Figure III.2.1: Indeed it goes as G2 [* d*p (1/p)(1/p) ~ G°A?. 

Without a cutoff on the theory, or equivalently with A = 00, theorists realized 
that the theory was sick: Infinity was the predicted value for a physical quantity. 
Fermi’s weak interaction theory was said to be nonrenormalizable. 

In desperation, some theorists advocated abandoning quantum field theory 
altogether. Others expended an enormous amount of effort trying to “cure” weak 
interaction theory. For instance, one approach was to speculate that the series 
M ~ G[1+ GA? + (GA)? + (GA*)* + ---]= Gf(GA?), where the unknown 
function f might have the property that f (00) is finite. In hindsight, we now know 
that this is not a fruitful approach. 

Instead, what happened was that toward the late 1960s S. Glashow, A. Salam, 
and S. Weinberg, building on the efforts of many others, succeeded in constructing 
an electroweak theory unifying the electromagnetic and weak interactions, as I 
will discuss in Chapter VII.2. Fermi’s weak interaction theory emerges within 
electroweak theory as a low energy effective theory. 
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Figure III.2.1 


Fermi’s theory cried out 


In modern terms, we think of the cutoff A as really being there and we hear the 
cutoff dependence of the four-fermion interaction amplitude M ~ G + G7A? as 
the sound of the theory crying out that something dramatic has to happen at the 
energy scale A ~ (1/G)2. The second term in the perturbation series becomes 
comparable to the first, so at the very least perturbation theory fails. 

Here is another way of making the same point. Suppose that we don’t know 
anything about cutoff and all that. With G having mass dimension —2, just by high 
school dimensional analysis we see that the neutrino-neutrino scattering amplitude 
at center-of-mass energy E has to goas M ~ G + G?E? + ---. When E reaches 
the scale ~ (1/ G)3 the amplitude reaches order unity and some new physics must 
take over just because the cross section is going to violate the unitarity bound from 
basic quantum mechanics. (Remember phase shift and all that?) 

In fact, what that something is goes back to Yukawa, who at the same time that 
he suggested the meson theory for the nuclear forces also suggested that an inter- 
mediate vector boson could account for the Fermi theory of the weak interaction. 
(In the 1930s the distinction between the strong and the weak interactions was far 
from clear.) Schematically, consider a theory of a vector boson of mass M coupled 
to a fermion field via a dimensionless coupling constant g: 


L= wiv"), —m)w — ZF FY + M7A,A* + gA,by"y (1) 


Let’s calculate fermion-fermion scattering. The Feynman diagram in Figure III.2.2 
generates an amplitude (—ig)* (#y“u)[i/(k* — M? + ié)|(uy,,u4), which when the 
momentum transfer k is much less than M becomes i(g?/M*)(iy"u) (ii Yyl). 
But this is just as if the fermions are interacting via a Fermi theory of the form 
Gaby’ W)Gby,W) with G = g?/M°. 

If we blithely calculate with the low energy effective theory G(vry"“w) (yr ¥u¥)s 
it cries out that itis going to fail. Yes sir indeed, at the energy scale (1/G) _ M/g, 
the vector boson is produced. New physics appears. 
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Figure III.2.2 


I find it sobering that theories in physics have the ability to announce their own 
eventual failure and hence their domains of validity, in contrast to theories in some 
other areas of human thought. 


Einstein’s theory is now crying out 


The theory of gravity is also notoriously nonrenormalizable. Simply comparing 
Newton’s law V(r) = Gy, M,M,/r with Coulomb’s V(r) = a/r we see that New- 
ton’s gravitational constant G,, has mass dimension —2. No more need be said. 
We come to the same morose conclusion that the theory of gravity, just like 
Fermi’s theory of weak interaction, is nonrenormalizable. To repeat the argument, 
if we calculate graviton-graviton scattering at energy E,, we encounter the series 
~ [1+ GyE? + (GyE’Y +--+) 

Just as in our discussion of the Fermi theory, the nonrenormalizability of 
quantum gravity tells us that at the Planck energy scale (1/G wy? = Mptanck ~ 
10!9m proton NeW physics must appear. Fermi’s theory cried out, and the new physics 
turned to be the electroweak theory. Einstein’s theory is now crying out. Will the 
new physics turn out to be string theory?! 


Exercise 
11.2.1. Consider the d-dimensional scalar field theory S = f d?x(1 (a9)? + Lm 9? + 


Agt+---+A,9"+---). Show that [g] = (d — 2)/2 and [A, ]= "(2 —d)/24+d. 
Note that g is dimensionless for d = 2. 


!J, Polchinski, String Theory. 


Chapter III.3 


Counterterms and Physical 
Perturbation Theory 


Renormalizability 


The heuristic argument of the previous chapter indicates that theories whose cou- 
pling has negative mass dimension are nonrenormalizable. What about theories 
with dimensionless couplings, such as quantum electrodynamics and the g* the- 
ory? As a matter of fact, both of these theories have been proved to be renormal- 
izable. But it is much more difficult to prove that a theory is renormalizable than 
to prove that it is nonrenormalizable. Indeed, the proof that nonabelian gauge the- 
ory (about which more later) is renormalizable took the efforts of many eminent 
physicists, culminating in the work of ’t Hooft, Veltman, B. Lee, Zinn-Justin, and 
many others. 

Consider again the simple g* theory. First, a trivial remark: The physical 
coupling constant A p is a function of sq, fg, and up [see (III.1.4)]. For theoretical 
purposes it is much less cumbersome to set 59, fp, and up equal to jz” and thus use, 
instead of (III.1.4), the simpler definition 


2 
~ikp = —id + 3iCA? log (*;) + OA) (1) 
B 


This is purely for theoretical convenience.! 


We saw that to order A” the meson-meson scattering amplitude when expressed 
in terms of the physical coupling A p is independent of the cutoff A. How do we 
prove that this is true to all orders in A? Dimensional analysis only tells us that 
to any order in A the dependence of the meson scattering amplitude on the cutoff 
must be a sum of terms going as [log(A/,)]? with some power p. 

The meson-meson scattering amplitude is certainly not the only quantity that 
depends on the cutoff. Consider the inverse of the g propagator to order A? as 
shown in Figure III.3.1. The Feynman diagram in Figure III.3.1a gives something 
like —id [“[d4q /(2x)*Ili/(q? — m? + ie)]. The precise value does not concern 


'Tn fact, the kinematic point sp = ty = ug = 2? cannot be reached experimentally. 
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(a) (b) 


Figure III.3.1 


us; we merely note that it depends quadratically on the cutoff A but not on k*. The 
diagram in Figure I11.3.1b involves a double integral 


d*p d‘q 
I A;A) = 
(k, m, Asa) = (—iay’ iy an Ox 


— er rr —_—____,__ (2) 
p2 — m* 4+ ie q? — m? Zhe pigaw —m*+1é 

Counting powers of p and q we see that the integral ~ {(d°P/P®) and so I 
depends quadratically on the cutoff A. 

By Lorentz invariance / is a function of k”, which we can expand in a series 
D+ Ek? + Fk’ +--+. The quantity D is just 7 with the external momentum k 
set equal to zero and so depends quadratically on the cutoff A. Next, we can obtain 
E by differentiating 7 with respect to k twice and then setting k equal to zero. This 
clearly decreases the powers of p and q in the integrand by 2 and so F depends 
only logarithmically on the cutoff A. Similarly, we can obtain F by differentiating 
I with respect to k four times and then setting k equal to zero. This decreases the 
powers of p and q in the integrand by 4 and thus F is given by an integral that 
goes as ~ f d®P/ P'° for large P. The integral is convergent and hence cutoff 
independent. We can clearly repeat the argument ad infinitum. Thus F and the 
terms in (-- -) are cutoff independent as the cutoff goes to infinity and we don’t 
have to worry about them. 

Putting it altogether, we have the inverse propagator k? — m? + a + bk? up 
to O(k*) with a and b, respectively, quadratically and logarithmically cutoff 
dependent. The propagator is changed to 


1 = 1 3) 
k2—m* = (14+ b)k* — (m2 — a) 
The pole in k? is shifted to m*> = m* + 6m? = (m? — a)(1+ b)~!, which we iden- 
tify as the physical mass. This shift is known as mass renormalization. Physically, 
it is quite reasonable that quantum fluctuations will shift the mass. 
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What about the fact that the residue of the pole in the propagator is no longer 1 
but (1+ b)—!9 

To understand this shift in the residue, recall that we blithely normalized the field 
gy so that £ = 1 (ag) +--+. That the coefficient of k? in the lowest order inverse 
propagator k* — m? is equal to 1 reflects the fact that the coefficient of 4(dg)? in 
£ is equal to J. There is certainly no guarantee that with higher order corrections 
included the coefficient of $(0 y)* in an effective & will stay at 1. Indeed, we see 
that it is shifted to (1 + b). For historical reasons, this is known as “wave function 
renormalization” even though there is no wave function anywhere in sight. A more 
modern term would be field renormalization. (The word renormalization makes 
some sense in this case, as we did normalize the field without thinking too much 
about it.) 

Incidentally, it is much easier to say “logarithmic divergent” than to say “log- 
arithmically dependent on the cutoff A,” so we will often slip into this more 
historical and less accurate jargon and use the word divergent. In g* theory, the 
wave function renormalization and the coupling renormalization are logarithmi- 
cally divergent, while the mass renormalization is quadratically divergent. 


Bare versus physical perturbation theory 


What we have been doing thus far is known as bare perturbation theory. We should 
have put the subscript 0 on what we have been calling gy, m, and A. The field g 
is known as the bare field, and mp and Ag are known as the bare mass and bare 
coupling, respectively. I did not put on the subscript 0 way back in Part I because 
I did not want to clutter up the notation before you, the student, even knew what a 
field was. 

Seen in this light, using bare perturbation theory seems like a really stupid thing 
to do, and it is. Shouldn’t we start out with a zeroth order theory already written 
in terms of the physical mass m p and physical coupling A p that experimentalists 
actually measure, and perturb around that theory? Yes, indeed, and this way of 
calculating is known as renormalized or dressed perturbation theory, or as I prefer 
to call it, physical perturbation theory. 

We write 


1 x 
L= 5 [ayy —m,9"]— ae + A(dy)* + By* + Cg (4) 


(A word on notation: The pedantic would probably want to put a subscript P 
on the field g, but let us clutter up the notation as little as possible.) Physical 
perturbation theory works as follows. The Feynman rules are as before, but with the 
crucial difference that for the coupling we use Ap and for the propagator we write 
i/(k? - m + ié) with the physical mass already in place. The last three terms 
in (4) are known as counterterms. The coefficients A, B, and C are determined 
iteratively (see later) as we go to higher and higher order in perturbation theory. 
They are represented as crosses in Feynman diagrams, as indicated in Figure 
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IlI.3.2, with the corresponding Feynman rules. All momentum integrals are cut 
off. 

Let me now explain how A, B, and C are determined iteratively. Suppose we 
have determined them to order an . Call their values to this order Ay, By, and 
Cy. Draw all the diagrams that appear in order aa +! We determine Ay 41, Bn ai, 
and Cy, by requiring that the propagator calculated to the order Po *Thasa pole 
at mp With a residue equal to 1, and that the meson-meson scattering amplitude 
evaluated at some specified values of the kinematic variables has the value —iA p. 
In other words, the counterterms are fixed by the condition that mp and Ap are 
what we say they are. Of course, A, B, and C will be cutoff dependent. Note that 
there are precisely three conditions to determine the three unknowns Ay ,;, By +1, 
and Cy 41. 

Explained in this way, you can see that it is almost obvious that physical 
perturbation theory works, that is, it works in the sense that all the physical 
quantities that we calculate will be cutoff independent. Imagine, for example, that 
you labor long and hard to calculate the meson-meson scattering amplitude to order 
Ap. It would contain some cutoff dependent and some cutoff independent terms. 
Then you simply add a contribution given by Cj, and adjust Cj7 to cancel the 
cutoff dependent terms. 

But ah, you start to worry. You say, “What if I calculate the amplitude for two 
mesons to go into four mesons, that is, diagrams with six external legs? If I get a 
cutoff dependent answer, then I am up the creek, as there is no counterterm of the 
form Dg? in (4) to soak up the cutoff dependence.” Very astute of you, but this 
worry is covered by the following power counting theorem. 


Degree of divergence 


Consider a diagram with B; external ¢ lines. First, a definition: A diagram is said 
to have a superficial degree of divergence D if it diverges as AP (A logarithmic 
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Figure I11.3.3 


divergence log A counts as D = 0.) The theorem says that D is given by 


I will give a proof later, but I will first illustrate what (5) means. For the inverse 
propagator, which has B; = 2, we are told that D = 2. Indeed, we encountered a 
quadratic divergence. For the meson-meson scattering amplitude, B; = 4, and so 
D =0, and indeed, it is logarithmically divergent. 

According to the theorem, if you calculate a diagram with six external legs (what 
is technically sometimes known as the six-point function), By = 6 and D = —2. 
The theorem says that your diagram is convergent or cutoff independent (i.e., the 
cutoff dependence disappears as A~). You didn’t have to worry. You should draw 
a few diagrams to check this point. Diagrams with more external legs are even 
more convergent. 

The proof of the theorem follows from simple power counting. In addition to 
By, and D, let us define B; as the number of internal lines, V as the number of 
vertices, and L as the number of loops. (It is helpful to focus on a specific diagram 
such as the one in Figure Iii.3.3 with Bf =6, D=—2, B, =5, V =4, and L =2.) 

The number of loops is just the number of {[d4k/(27)*] we have to do. Each 
internal line carries with it a momentum to be integrated over, so We seem to have 
B, integrals to do. But the actual number of integrals to be done is of course 
decreased by the momentum conservation delta functions associated with the 
vertices, one to each vertex. There are thus V delta functions, but one of them 
is associated with overall momentum conservation of the entire diagram. Thus, 
the number of loops is 


L=B,-(V—-1) (6) 


[If you have trouble following this argument, you should work things out for the 
diagram in Figure [1.3.3 for which this equation reads 2 = 5 — (4 — ]).] 
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For each vertex there are four lines coming out (or going in, depending on how 
you look at it). Each external line comes out of (or goes into} one vertex. Each 
internal line connects two vertices. Thus, 


4V = B, +2B, (7) 


(For Figure II1.3.3 this reads 4.4 =6+4 2-5.) 
Finally, for each loop there is a f d*k while for each internal line there is a 
i/(k? — m? + ie), bringing the powers of momentum down by 2. Hence, 


D=4L —2B, (8) 


(For Figure III.3.3 this reads -2 = 4-2 —2-5.) 
Putting (6), (7), and (8) together, we obtain the theorem (5).2 As you can plainly 
see, this formalizes the power counting we have been doing all along. 


Degree of divergence with fermions 


To test if you understood the reasoning leading to (5), consider the Yukawa theory 
we met in (1I.5.19). (We suppress the counterterms for typographical clarity.) 


L = wliy"d, —mp)v + 4109) — nee" l—-Ape + fpevy —9) 


Now we have to count F; and F;, the number of internal and external fermion 
lines, respectively, and keep track of Vy and V,, the number of vertices with the 
coupling f and A, respectively. We have five equations altogether. For instance, 
(7) splits into two equations because we now have to count fermion lines as well 
as boson lines. For example, we now have 


We finally obtain 
D=4-B,-—3F, (11) 


So the divergent amplitudes, that is, those classes of diagrams with D > 0, have 
(Br, Fr) = (0, 2), (2, 0), (1, 2), and (4, 0). We see that these correspond to 
precisely the six terms in the Lagrangian (9), and thus we need six counterterms. 

Note that this counting of superficial powers of divergence shows that all terms 
with mass dimension < 4 are generated. For example, suppose that in writing down 
the Lagrangian (9) we forgot to include the A pg* term. The theory would demand 


? The superficial degree of divergence measures the divergence of the Feynman diagram 
as all internal momenta are scaled uniformly by k > Ak with A tending to infinity. In a more 
rigorous treatment we have to worry about the momenta in some subdiagram (a piece of 
the full diagram) going to infinity with other momenta held fixed. 
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that we include this term: We have to introduce it as a counterterm [the term with 
(Br, F;) = (4, 0) in the list above]. 

A common feature of (5) and (11) is that they both depend only on the number 
of external lines and not on the number of vertices V. Thus, for a given number of 
external lines, no matter to what order of perturbation theory we go, the superficial 
degree of divergence remains the same. Further thought reveals that we are merely 
formalizing the dimension-counting argument of the preceding chapter. [Recall 
that the mass dimension of a Bose field [g] is 1 and of a Fermi field [y] is 3. 
Hence the coefficients 1 and 3 in (11).] 

Our discussion hardiy amounts to a rigorous proof that theories such as the 
Yukawa theory are renormalizable. If you demand rigor, you should consult the 
many field theory tomes on renormalization theory, and I do mean tomes, in which 
such arcane topics as overlapping divergences and Zimmerman’s forest formula 
are discussed in exhaustive and exhausting detail. 


Nonrenormalizable field theories 


It is instructive to see how nonrenormalizable theories reveal their unpleasant 
personalities when viewed in the context of this discussion. Consider the Fermi 
theory of the weak interaction written in a simplified form: 


L=liy"d, —mp)v + Gy)’. 


The analogs of (6), (7), and (8) now read L = F; —(V — 1), 4V =F,4 2F,, 
and D = 4L — F;. Solving for the superficial degree of divergence in terms of the 
external number of fermion lines, we find 


D=4-3F,+2V (12) 


Compared to the corresponding equations for renormalizable theories (5) and 
(11), D now depends on V. Thus if we calculate fermion-fermion scattering 
(F,; =4), for example, the divergence gets worse and worse as We go to higher and 
higher order in the perturbation series. This confirms the discussion of the previous 
chapter. But the really bad news is that for any F,, we would start running into 
divergent diagrams when V gets sufficiently large, so we would have to include 
an unending stream of counterterms (Ww)*, (Ww)*, (Ww), --:, each with an 
arbitrary coupling constant to be determined by an experimental measurement. 
The theory is severely limited in predictive power. 

At one time nonrenormalizable theories were considered hopeless, but they are 
accepted in the modern view based on the effective field theory approach, which I 
will discuss in Chapter VIIL.3. 
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Dependence on dimension 


The superficial degree of divergence clearly depends on the dimension d of space- 
time since each loop is associated with [ d¢k. For example, consider the Fermi 
interaction G(wy)* in (1+ 1)-dimensional spacetime. Of the three equations that 
went into (12) one is changed to D = 2L — F7, giving 


D=2-4Fr (13) 


the analog of (12) for 2-dimensional spacetime. In contrast to (12), V no longer 
enters, and the only superficial diagrams have F, = 2 and 4, which we can 
cancel by the appropriate counterterms. The Fermi interaction is renormalizable 
in (1 + 1)-dimensional spacetime. I will come back to it in Chapter VIL.4. 


The Weisskopf phenomenon 


I conclude by pointing out that the mass correction to a Bose field and to a Fermi 
field diverge differently. Since this phenomenon was first discovered by Weisskopf, 
I refer to it as the Weisskopf phenomenon. To see this go back to (11) and observe 
that B, and F, contribute differently to the superficial degree of divergence D. 
For B; =2, Fy =0, we have D = 2, and thus the mass correction to a Bose field 
diverges quadratically, as we have already seen explicitly [the quantity a in (3)]. 
But for F; =2, By =0, we have D = 1 and it looks like the fermion mass is 
linearly divergent. Actually, in 4-dimensional field theory we cannot possibly get 
a linear dependence on the cutoff. To see this, it is easiest to look at, as an example, 
the Feynman integral you wrote down for Exercise IL.5.1, for the diagram in Figure 
EE.5.1: 


d’k 1 p+k+m 


qj a _— = 2 2 
(2n)4 k? — pe? (p +k)? — m2 = A(p*) p + B(p*) (14) 


fyi? 


where I define the two unknown functions A(p*) and B(p”) for convenience. 
(For the purpose of this discussion it doesn’t matter whether we are doing bare 
or physical perturbation theory. If the latter, then I have suppressed the subscript 
P for the sake of notational clarity.) 

Look at the integrand for large k. You see that the integral goes as { A d*k( K/k*) 
and looks linearly divergent, but by reflection symmetry k — —k the integral to 
leading order vanishes. The integral in (14) is merely logarithmically divergent. 
The superficial degree of divergence D often gives an exaggerated estimate of 
how bad the divergence can be (hence the adjective “superficial”’). In fact, staring 
at (14) we can prove more, for instance, that B( p”) must be proportional to m. As 
an exercise you can show, using the Feynman rules given in Chapter H.5, that the 
same conclusion holds in quantum electrodynamics. 

For a boson, quantum fluctuations give 6. « A?/j, while for a fermion such 
as the electron, quantum correction to its mass 5m « m log(A/m) is much more 
benign. It is interesting to note that in the early twentieth century physicists thought 
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of the electron as a ball of charge of radius a. The electrostatic energy of such a 
ball, of the order e?/a, was identified as the electron mass. Interpreting 1/a as 
A, we could say that in classical physics the electron mass is proportional to A 
and diverges linearly. Thus, one way of stating the Weisskopf phenomenon is that 
“bosons behave worse than a classical charge, but fermions behave better.” 

As Weisskopf explained in 1939, the difference in the degree of the divergence 
can be understood heuristically in terms of quantum statistics. The “bad” behavior 
of bosons has to do with their gregariousness. A fermion would push away the 
virtual fermions fluctuating in the vacuum, thus creating a cavity in the vacuum 
charge distribution surrounding it. Hence its self-energy is less singular than would 
be the case were quantum statistics not taken into account. A boson does the 
opposite. 

The “bad” behavior of bosons will come back to haunt us later. 


Exercises 


1.3.1. Show that in (1 + 1)-dimensional spacetime the Dirac field y has mass dimension 
3, and hence the Fermi coupling is dimensionless. 


11.3.2. Derive (11) and (13). 


1.3.3. Show that B(p?) in (14) vanishes when we set m = 0. Show that the same behavior 
holds in quantum electrodynamics. 


IlI.3.4. We showed that the specific contribution (14) to dm is logarithmically divergent. 
Convince yourself that this is actually true to any finite order in perturbation 
theory. 


Chapter II.4 


Gauge Invariance: A Photon 
Can Find No Rest 


When the central identity blows up 


I explained in Chapter I.7 that the path integral for a generic field theory can be 
formally evaluated in what deserves to be called the Central Identity of Quantum 
Field Theory: 


J Doe 2@K PV O+IG <p V O/B) ) gh IKI (1) 


For any field theory we can always gather up all the fields, put them into one giant 
column vector, and call the vector g. We then single out the term quadratic in ¢ 
write itas jg - K -, andcall the rest V (y). Lam using a compact notation in which 
spacetime coordinates and any indices on the field, including Lorentz indices, are 
included in the indices of the formal matrix K . We will often use (1) with V =0: 


But what if K does not have an inverse? 
This is not an esoteric phenomenon that occurs in some pathological field 
theory, but in one of the most basic actions of physics, the Maxwell action 


S(A) = J d‘xL= J d‘x [34,0073 =a" a")A, + A, J". (3) 


The formal matrix K in (2) is proportional to the differential operator (@7g#” — 
a9") = OY”. A matrix does not have an inverse if some of its eigenvalues are 
zero, that is, if when acting on some vector, the matrix annihilates that vector. 
Well, observe that Q*” annihilates vectors of the form 0, A(x): "0, A(x) =0. 
Thus Q“” has no inverse. 

There is absolutely nothing mysterious about this phenomenon; we have already 
encountered it in classical physics. Indeed, when we first learned the concepts 
of electricity, we were told that only the “voltage drop” between two points has 
physical meaning. At a more sophisticated level, we learned that we can always add 
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any constant (or indeed any function of time) to the electrostatic potential (which 
is of course just “voltage”) since by definition its gradient is the electric field. At 
an even more sophisticated level, we see that solving Maxwell’s equation (which 
of course comes from just extremizing the action) amounts to finding the inverse 
Q7'. [In the notation I am using here Maxwell’s equation 0, Fe" = J” is written 
as O,,,A” = J”, and the solution is Av = (Q7')""J,,.] 

Well, Q—! does not exist! What do we do? We learned that we must impose an 
additional constraint on the gauge potential A“, known as “fixing a gauge.” 


A mundane nonmystery 


To emphasize the rather mundane nature of this gauge fixing problem (which some 
older texts tend to make into something rather mysterious and almost hopelessly 
difficult to understand), consider just an ordinary integral [7° dAe~4-*-4, with 
A = (a, b) a2-component vector and K = i . , , amatrix without an inverse. 


Of course you realize what the problem is: We have [7° [7°° da db e~® and 
the integral over b does not exist. To define the integral we insert into it a delta 
function 5(b — &). The integral becomes defined and actually does not depend 
on the arbitrary number €. More generally, we can insert 5[ f(b)] with f some 
function of our choice. In the context of an ordinary integral, this procedure is 
of course ludicrous overkill, but we will use the analog of this procedure in what 
follows. 

The necessity for imposing by hand a (gauge fixing) constraint in gauge theories 
can be seen from physical reasoning as well. In Chapter I.5 we sidestep this whole 
issue of fixing the gauge by treating the massive vector meson instead of the photon. 
In effect, we change QO” to (87 + m?)g#” — a", which does have an inverse 
(in fact We even found the inverse explicitly). 

Consider a massive vector meson moving along. By a Lorentz boost, we can 
always bring it to its rest frame, where we can apply what we learned about the 
rotation group, namely that a spin 1 meson has three spin states or, classically, 
polarization states. But if the vector meson is massless, we can no longer find 
a Lorentz boost that would bring us to its rest frame. A photon can find no 
rest! 

A massless spin 1 field is intrinsically different from a massive spin 1 field— 
that’s the crux of the problem. Now we have only rotations around the direction 
of motion of the photon, that is, O(2). The photon has only two polarization 
degrees of freedom. (You already learned in classical electrodynamics that an 
electromagnetic wave has two transverse degrees of freedom.) This is the true 
physical origin of gauge invariance. 

In this sense, gauge invariance is, strictly speaking, not a “real” symmetry but 
merely a reflection of the fact that we used a redundant description: a Lorentz 
vector field to describe two physical degrees of freedom. 
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Restricting the functional integral 


I will now discuss the method for dealing with this redundancy invented by Faddeev 
and Popov. As you will see presently, it is the analog of the method we used in 
our baby problem above. Even in the context of electromagnetism this method is 
a bit of overkill, but it will prove to be essential for nonabelian gauge theories (as 
we will see in Chapter VII.1) and for gravity. I will describe the method using a 
completely general and somewhat abstract language. In the next section, I will then 
apply the discussion here to a specific example. If you have some trouble with this 
section, you might find it helpful to go back and forth between the two sections. 

Suppose we have to do the integral J = [ DAe'*“); this can be an ordinary 
integral or a path integral. Suppose that under the transformation A -> A, the 
integrand and the measure do not change, that is, S(A) = S(A,) and DA = DA,. 
The transformations obviously form a group, since if we transform again with 
g’, the integrand and the measure do not change under the combined effect of 
g and g’ and A, -> (Ag): = Aga. We would like to write the integral 7 in the 
form I = (f Dg)J, with J independent of g. In other words, we want to factor 
out the redundant integration over g. Note that Dg is the invariant measure over 
the group of transformations and f Dg is the volume of the group. Be aware of 
the compactness of the notation in the case of a path integral: A and g are both 
functions of the spacetime coordinates x. 

I want to emphasize that this hardly represents anything profound or mysterious. 
If you have to do the integral I = f dx dy e'5-») with S(x, y) some function of 
x? + y, you know perfectly well to go to polar coordinates I = (f d0)J = (27) J, 
where J = f{ dr re'*\) is an integral over the radial coordinate r only. The factor 
2z is precisely the volume of the group of rotations in 2 dimensions. 

Faddeev and Popov showed how to do this “going over to polar coordinates” 
in a unified and elegant way. Following them, we first write the numeral “one” 
as 1 = A(A) f Dgs[f(A,)], an equality that merely defines A(A). Here f is 
some function of our choice and A(A), known as the Faddeev-Popov determi- 
nant, of course depends on f. Next, note that [A(A,) |"! = f Dgd[f(A,,)]= 
f Dg’ 5[f (Ay)]=[A(A)T', where the second equality follows upon defining 
g” = g’g and noting that Dg” = Dg. In other words, we showed that A(A) = 
A(A,) : the Faddeev-Popov determinant is gauge invariant. We now insert | into 
the integral J we have to do: 


1=| DAe'S(4) 
_ | DAeSAA(A) il Deal f(A,)I 


= il Dg il DAe'“) A(A)s f(A,)] (4) 


170 Hl. Renormalization and Gauge Invariance 


As physicists and not mathematicians, we have merrily interchanged the order of 
integration. 

At the physicist’s level of rigor, we are always allowed to change integration 
variables until proven guilty. So let us change A to A g—} then 


I= ( il De) | DAe A (A)8[ f (A)] (5) 


where we have used the fact that DA, S(A), and A(A) are all invariant under A > 
A gle 

That’s it. We’ve done it. The group integration ({ Dg) has been factored out. 

The volume of a compact group is finite, but in gauge theories there is a separate 
group at every point in spacetime, and hence ({ Dg) is an infinite factor. (This also 
explains why there is no gauge fixing problem in theories with global symmetries 
introduced in Chapter I.9.) Fortunately, in the path integral Z for field theory we 
do not care about overall factors in Z, as was explained in Chapter 1.3, and thus 
the factor ({ Dg) can simply be thrown away. 


Fixing the electromagnetic gauge 


Let us now apply the Faddeev-Popov method to electromagnetism. The transfor- 

mation leaving the action invariant is of course A,,—> A, — 3,A, so g in the 

present context is denoted by A and A, = A,, — 0,,A. Note also that since the in- 

tegral J we started with is independent of f it is still independent of f in spite of 

its appearance in (5). Choose f(A) = 0A — oo, where o is a function of x. In par- 

ticular, I is independent of o and so we can integrate J with an arbitrary functional 
; 4 2 

of o, in particular, the functional e~“/7*? f axa) 

We now turn the crank. First, we calculate 


[A(Ay- = | Dgd[ f(A, = il DAS(@A — 0°A —) (6) 
Next we note that in (5) A(A) appears multiplied by 5[_f(A)] and so in evaluating 
[A(A)]"! in (6) we can effectively set f(A) = 0A — oo to zero. Thus from (6) we 
have A(A) “=” [ f DA6(87A)]—1. But this object does not even depend on A, so 
we can throw it away. Thus, up to irrelevant overall factors that could be thrown 


away I is just f DAe’S\§(8A — a). 
Integrating over o (x) as we said we were going to do, we finally obtain 


Z= J Doe t/8) f Axo? il DAe'*45(9A — 0) 


_ il DAeiS(A-G/28) f d4x@ay? (7) 


Nifty trick by Faddeev and Popov, eh? 
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Thus, S(A) in (3) is effectively replaced by 


— §(A) — — 4 
Suyp(A) = (A) ~ 5 -| d*x(aAy? 


= fas {54, [ster — (1 — =) ava” Ay+ 4,1" (8) 


and Q4” by QUE = d7gH¥ — (1 — 1/€)d“a" or in momentum space Qf = 
—k*g!Y 4+ (1 — 1/&)k#k”, which does have an inverse. Indeed, you can check 


that 
kk, | 1 
ON. :|- Gxt | k2 = 5) 
Thus, the photon propagator can be chosen to be 
(—i) k,k 
a ess (1— {2 ” (9) 


in agreement with the conclusion in Chapter IL.7. 

While the Faddeev-Popov argument is a lot slicker, many physicists still prefer 
the explicit Feynman argument given in Chapter II.7. Ido, When we deal with the 
Yang-Mills theory and the Einstein theory, however, the Faddeev-Popov method 
is indispensable, as I have already noted. 


A reflection on gauge symmetry 


As we will see later and as you might have heard, much of the world beyond 
electromagnetism is also described by gauge theories. But as we saw here, gauge 
theories are also deeply disturbing and unsatisfying in some sense: They are 
built on a redundancy of description. The electromagnetic gauge transformation 
A, — A, — 4,,A is not truly a symmetry stating that two physical states have the 
same properties. Rather, it tells us that the two gauge potentials A,, and A,, — 0,,A 
describe the same physical state. In your orderly study of physics, the first place 
where A,, becomes indispensable is the Schrédinger equation, as I will explain in 
Chapter v4. Within classical physics, you got along perfectly well with just E 
and B. Some physicists are looking for a formulation of quantum electrodynamics 
without using A,,, but so far have failed to tum up an attractive alternative to what 
we have. It is conceivable that a truly deep advance in theoretical physics would 
involve writing down quantum electrodynamics without writing A,,. 


Chapter II.5 
Field Theory without Relativity 


Slower in its maturity 


Quantum field theory at its birth was relativistic. Later in its maturity, it found 
applications in condensed matter physics. We will have a lot more to say about the 
role of quantum field theory in condensed matter, but for now, we have the more 
modest goal of learning how to take the nonrelativistic limit of a quantum field 
theory. 

The Lorentz invariant scalar field theory 


L =(80')(a0) — moto — a(otoy? (1) 


(with A > 0 as always) describes a bunch of interacting bosons. It should certainly 
contain the physics of slowly moving bosons. For clarity consider first the rela- 
tivistic Klein-Gordon equation 


(a7 +m)o =0 (2) 


for a free scalar field. A mode with energy E = m + ¢ would oscillate in time as 
® « e7!** In the nonrelativistic limit, the kinetic energy ¢ is much smaller than 
the rest mass m. It makes sense to write ®(x, t) = e~!”" g(x, t), with the field y 
oscillating much more slowly than e~'”” in time. Plugging into (2) and using the 
identity (8/at)e—"” (- --} =e~'™ (im + 4/at)(- - -) twice, we obtain (—im + 
a/at)-y — V-¢ + m?y = 0. Dropping the term (37/3t)g as small compared to 
—2im(a/dt)g, we find Schrédinger’s equation, as we had better: 


i 9 =~ (3) 

By the way, the Kilein-Gordon equation was actually discovered before 
Schrédinger’s equation. 

Having absorbed this, you can now easily take the nonrelativistic limit of a 


quantum field theory. Simply plug 


®(F, 1) = Fee mo, t) (4) 
im 
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into (1) . (The factor 1/./2m is for later convenience.) For example, 


adiad sy 1 (: a II ( >) i 
— — —m’o'o + — im-+— —im + — —m 
at at >= || *) 9 ar) © ad 
+ 
1.{ tdg dg 
~ = —-— 5 
2 ¢ at at ” ©) 
After an integration by parts we arrive at 
i 
L =iglig — =—avlay — 8°(y'oy (6) 


where g* = A/4m?. 

As we saw in Chapter I.9 the theory (1) enjoys a conserved Noether current 
J, =i(*a,,® — 4, &'). The density Jo reduces to y'y, precisely as you would 
expect, while J; reduces to (i/2m)(g'd,@ — 8,g'g). When you first took a course 
in quantum mechanics, didn’t you wonder why the density p = g'g and the 
current J; = (i/2m)(y'a,g — 4,g'¢) look so different? As to be expected, various 
expressions inevitably become uglier when reduced from a more symmetric to a 
less symmetric theory. 


Number is conjugate to phase angle 


Let me point out some differences between the relativistic and nonrelativistic case. 

The most striking is that the relativistic theory is quadratic in time derivative, 
while the nonrelativistic theory is linear in time derivative. Thus, in the nonrela- 
tivistic theory the momentum density conjugate to the field y, namely 6£/ddg¢, is 
justi’, so that [g7 (x, t), o(x", t)} = —6(& — ¥’). In condensed matter physics 
it is often illuminating to write g = Jf pel? so that 


i J ] 
L = ~ayp — pad — — Boos + = ey — 3’ p° (7) 
2 2m 4p 


The first term is a total divergence. The second term tells us something of great 
importance! in condensed matter physics: in the canonical formalism (Chapter 
1.8), the momentum density conjugate to the phase field @(x) is 60/3096 = —p 
and thus Heisenberg tells us that 


[o(X, t), 0%’, Hp = id (F — x’ (8) 


' See P. Anderson, Basic Notions of Condensed Matter Physics, p. 235. 
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Integrating and defining N = f d?xp(x, t) = the total number of bosons, we find 
one of the most important relations in condensed matter physics 


[N, 0] =i (9) 


Number is conjugate to phase angle, just as momentum is conjugate to position. 
Marvel at the elegance of this! You would learn in a condensed matter course that 
this fundamental relation underlies the physics of the Josephson junction. 

You may know that a system of bosons with a “hard core” repulsion between 
them is a superfluid at zero temperature. In particular, Bogoliubov showed that the 
system contains an elementary excitation obeying a linear dispersion relation.” I 
will discuss superfluidity in Chapter V.1. 


The sign of repulsion 


In the nonrelativistic theory (7) it is clear that the bosons repel each other: Piling 
particles into a high density region would require an energy go. But itis less clear 
in the relativistic theory that A(t)? with A positive corresponds to repulsion. I 
outline one method in Exercise III.5.3, but here let’s just take a flying heuristic 
guess. The Hamiltonian (density) involves the negative of the Lagrangian and 
hence goes as 4(®*)? for large ® and would thus be unbounded below for A < 0. 
We know physically that a free Bose gas tend to condense and clump, and with 
an attractive interaction it surely might want to collapse. We naturally guess that 
A > 0 corresponds to repulsion. 

I next give you a more foolproof method. Using the central identity of quantum 
field theory we can rewrite the path integral for the theory in (1) as 


a= J DODeei f MxlGONG)—m? 01042007 O+(1/A)07] (10) 


Condensed matter physicists call the transformation from (1) to the Lagrangian 
L = (a0')(a@) — m?0'@ + 20 &'® + (1/4)o” the Hubbard-Stratonovich trans- 
formation. In field theory, a field that does not have kinetic energy, such as o,, is 
known as an auxiliary field and can be integrated out in the path integral. When 
we come to the superfield formalism in Chapter VIIL4, auxiliary fields will play 
an important role. 

Indeed, you might recall from Chapter III.2 how a theory with an intermediate 
vector boson could generate Fermi’s theory of the weak interaction. The same 
physics is involved here: The theory (10) in which the ® field is coupled to an 
“intermediate o boson” can generate the theory (1). 

If o were a “normal scalar field” of the type we have studied, that is, if the 
terms quadratic in o in the Lagrangian had the form 1(aa)" —1M 27, then its 


? For example, L.D. Landau and E. M. Lifschitz, Statisical Physics, p. 238. 
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propagator would be i/(k? — M? + ie). The scattering amplitude between two ® 
bosons would be proportional to this propagator. We learned in Chapter I.4 that 
the exchange of a scalar field leads to an attractive force. 

But o is not anormal field as evidenced by the fact that the Lagrangian contains 
only the quadratic term +(1/A)o7. Thus its propagator is simply i/(1/A) =iA, 
which (for 4 > 0) has a sign opposite to the normal propagator evaluated at low- 
momentum transfer i /(k? — M? +ie) ~ —i/M*. We conclude that o exchange 
leads to a repulsive force. 

Incidentally, this argument also shows that the repulsion is infinitely short 
ranged, like a delta function interaction. Normally, as we learned in Chapter I.4 
the range is determined by the interplay between the k* and the M? terms. Here 
the situation is as if the M? term is infinitely large. We can also argue that the 
interaction A(@')? involves creating two bosons and then annihilating them all 
at the same spacetime point. 


Finite density 


One final point of physics that people trained as particle physicists do not always 
remember: Condensed matter physicists are not interested in empty space, but want 
to have a finite density 6 of bosons around. We learned in statistical mechanics to 
add a chemical potential term jzg'y to the Lagrangian (6). Up to an irrelevant (in 
this context!) additive constant, we can rewrite the resulting Lagrangian as 


. 1 _ 
L =ig' dy — 5, BY div — (yy — py (11) 


Amusingly, mass appears in different places in relativistic and nonrelativistic 
field theories. To proceed further, I have to develop the concept of spontaneous 
symmetry breaking. Thus, adios for now. We will come back to superfluidity in 
due time. 


Exercises 


I1.5.1. Obtain the Klein-Gordon equation for a particle in an electrostatic potential 
(such as that of the nucleus) by the gauge principle of replacing (0/9r) in (2) 
by 0/dt —ieAg. Show that in the nonrelativistic limit this reduces to the 
Schrédinger’s equation for a particle in an external potential. 


11.5.2. Take the nonrelativistic limit of the Dirac Lagrangian. 


Ii1.5.3. Given a field theory we can compute the scattering amplitude of two particles 
in the nonrelativistic limit. We then postulate an interaction potential U(x) be- 
tween the two particles and use nonrelativistic quantum mechanics to calculate 
the scattering amplitude, for example in Born approximation. Comparing the 
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two scattering amplitudes we can determine U(x). Derive the Yukawa and the 
Coulomb potentials this way. The application of this method to the A(@i)* 
interaction is slightly problematic since the delta function interaction is a bit sin- 
gular, but it should be all right for determining whether the force is repulsive or 
attractive. 


Chapter III.6 
The Magnetic Moment of the Electron 


Dirac’s triumph 


I said in the preface that the emphasis in this book is not on computation, but how 
can I not tell you about the greatest triumph of quantum field theory? 

After Dirac wrote down his equation, the next step was to study how the electron 
interacts with the electromagnetic field. According to the gauge principle already 
used to write the Schrédinger’s equation in an electromagnetic field, to obtain the 
Dirac equation for an electron in an external electromagnetic field we merely have 
to replace the ordinary derivative d,, by the covariant derivative D,, =0,, —ieA,,: 


(iy"D,, — my =0 (1) 


Recall (I1.1.27). 

Acting on this equation with (iy“D, +m), we obtain —(y"y"D,D, + 
my = 0. We have y'y'’D,D, = sy", yyt+ty’, y’)D,D, = D,,D® ~ 
io’*D,,D, andio®"D,,D, = G/2)o""[D,, D,J= (e/2)o"" F.,,,. Thus, 


(D,0* _ 50 Fs + m?) yy =0 (2) 


Now consider a weak constant magnetic field pointing in the 3rd direction 
for definiteness, weak so that we can ignore the (A;)* term in (D,)’. By gauge 
invariance, we can choose Aj = 0, A, = —1Bx’, and Az = 1 Bx! (so that F\2 = 
@,A2 — 0,A; = B). As we will see, this is one calculation in which we really have 
to keep track of factors of 2. Then 

(D,)? = (8;)? — ie(a;A; + A;8;) + O(A7) 
ie 
= (3;)* — 2= Btx'dn — x74.) + OCA?) 
= V? —eB-% x p+ O(A?) (3) 


Note that we used 0; A; + A;0; = (0;A;) + 2A,0; = 2A;9;, where in (0; A;) the par- 
tial derivative acts only on A;. You may have recognized L = x x p as the orbital 
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angular momentum operator. Thus, the orbital angular momentum generates an 
orbital magnetic moment that interacts with the magnetic field. 

This calculation makes good physical sense. If we were studying the interaction 
of a charged scalar field ® with an external electromagnetic field we would start 
with 


(D,,D" + m’)@ =0 (4) 


obtained by replacing the ordinary derivative in the Klein-Gordon equation by 
covariant derivatives. We would then go through the same calculation as in (3). 
Comparing (4) with (2) we see that the spin of the electron contributes the addi- 
tional term (¢/2)o"" F,,. 

As in Chapter II.1 we write y = (?) in the Dirac basis and focus on 
@ since in the nonrelativistic limit it dominates x. Recall that in that basis 
ot) = glik (% ot). Thus (e/2)o+"F,,,, acting on @ is effectively equal to 


(e/2)03(F 1. — Fo) = (e/2)20°B = 2eB - § since $ = (G /2). Make sure you un- 

derstand all the factors of 2! Meanwhile, according to what I told you in chapter 

II.1, we should write ¢ =e7"”'W, where W oscillates much more slowly than 

e~'™! so that (96 + m)e—™ W ~ e—™ [_2im(3/dt)V]. Putting it all together, 
_we have 


[21m = - #8 E+25)| Vv =0 (5) 


There you have it! As if by magic, Dirac’s equation tells us that a unit of spin 
angular momentum interacts with a magnetic field twice as much as a unit of orbital 
angular momentum, an observational fact that had puzzled physicists deeply at the 
time. The calculation leading to (5) is justly celebrated as one of the greatest in the 
history of physics. 

The story is that Dirac did not do this calculation until a day after he discovered 
his equation, so sure was he that the equation had to be right. Another version is 
that he dreaded the possibility that the magnetic moment would come out wrong 
and that Nature would not take advantage of his beautiful equation. 

Another way of seeing that the Dirac equation contains a magnetic moment is 
by the Gordon decomposition, the proof of which is given in an exercise: 


/ Mh tee py! — 
i(p')y“u(p) = a(p’) —— + ermal u(p) 6) 
m 2m 


Looking at the interaction with an electromagnetic field a(p)y"u(p)A,,(p' ~ p), 
we see that the first term in (6) only depends on the momentum (p’ + p)“ and 
would have been there even if we were treating the interaction of a charged scalar 
particle with the electromagnetic field to first order. The second term involves 
spin and gives the magnetic moment. One way of saying this is that z(p’)y“u(p) 
contains a magnetic moment component. 
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The anomalous magnetic moment 


With improvements in experimental techniques, it became clear by the late 1940’s 
that the magnetic moment of the electron was larger than the value calculated by 
Dirac by a factor of 1.00118 + 0.00003. The challenge to any theory of quantum 
electrodynamics was to calculate this so-called anomalous magnetic moment. As 
you probably know, Schwinger’s spectacular success in meeting this challenge 
established the correctness of relativistic quantum field theory, at least in dealing 
with electromagnetic phenomena, beyond any doubt. 

Before we plunge into the calculation, note that Lorentz invariance and current 
conservation tell us (see Exercise II.6.3) that the matrix element of the electro- 
magnetic current must have the form (here |p, s) denotes a state with an electron 
of momentum p and polarization s) 


wy 
(p', s"| J*(0) |p, s) = a(p', 5’) Pace + et Fq?)| u(p,s) (7) 


where g =(p’ — p). The functions F,(g7) and F>(q*), about which Lorentz 
invariance can tell us nothing, are known as form factors. To leading order in 
momentum transfer g, (7) becomes 


/ jt + og [LV 
ai(p', 5" {PHP FO + rR + F,0)| u(p, 8) 


by the Gordon decomposition. The coefficient of the first term is the electric charge 
observed by experimentalists and is by definition equal to 1. Thus F,(0) = 1. 
The magnetic moment of the electron is shifted from the Dirac value by a factor 
1+ F,(0). 


Schwinger’s triumph 


Let us now calculate F,(0) to order w = e”/ 47. First draw all the relevant Feynman 
diagrams to this order (Fig. II.6.1). Except for Figure 1b, all the Feynman diagrams 
are clearly proportional to“(p’, s‘)y"u(p, s) and thus contribute to F- (q7), which 
we don’t care about. Happy are we! We only have to calculate one Feynman 
diagram. 

It is convenient to normalize the contribution of Figure 1b by comparing it to the 
lowest order contribution of Figure 1a and write the sum of the two contributions 
as u(y" + T*)u. Applying the Feynman rules, we find 


d’k -i i i 
rH = = fjeyp_1 yw ty 8 
(2n)4 2 (ier B+ k—m Eni) 8) 


I will now go through the calculation in some detail not only because it is 
important, but also because we will be using a variety of neat tricks springing 
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Figure II1.6.1 


from the brilliant minds of Schwinger and Feynman. You should verify all the 


steps of course. 
Simplifying somewhat we obtain [“ = —ie* f[d*k/(27)*](N“/D), where 


NY sy'(p+ K+my"(b+ k-m)y, (9) 


and 


1 1 
5 ip pC =? | ewaas. (10) 
D (p' +k? — ml (pth? —m? kh? D 
We have used the identity (D.16). The integral is evaluated over the triangle in the 
(a-B) plane bounded by a = 0, 6 = 0, anda + f = 1, and 


D =[k? + 2k(ap’ + Bp)P = [I? — (@ + B)*m’P + O@?) (11) 


where we completed a square by defining k =/ — (ap’ + Bp). 
Our strategy is to massage N” into a form consisting of a linear combination 
of y", p”, and p’’. Invoking the Gordon decomposition (6) we can write (7) as 


_ 1_, 
u {ytFc@? + Fy(q*)]— om P+ pray} u 


Thus, to extract F,(0) we can throw away without ceremony any term proportional 
to y” that we encounter while massaging N”. So, let’s proceed. 
Eliminating k in favor of / in (9) we obtain 


No =y T+ Pi+mly*l/+ P+miy, (12) 


where P’# = (1— a) p’* — Bp" and P# = (1— B)p" — ap”. I will use the iden- 
tities in Appendix D repeatedly, without alerting you every time I use one. It is 
convenient to organize the terms in N” by powers of m. (Here I give up writing in 
complete grammatical sentences.) 


1. The m? term: a y” term, throw away. 
2. The m terms: organize by powers of /. The term linear in / integrates to 0 by 
symmetry. Thus, we are left with the term independent of /: 
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my” Plyy, +y°y” Py,) =4m[(l — 2a) p™ + (1-28) p*] 
— 4m(i ~ a — B)(p'+ p)* (13) 


In the last step I used a handy trick; since D is symmetric under a <—> £, 
we can symmetrize the terms we get in N’, 

3. Finally, the most complicated m° term. The term quadratic in /: note that 
we can effectively replace [°/* inside f d41/(2x)* by 1n°*l* by Lorentz 
invariance (this step is possible because we have shifted the integration 
variable so that D is a Lorentz invariant function of /*.) Thus, the term 
quadratic in / gives rise to a y* term. Throw it away. Again we throw away 
the term linear in /, leaving [use (D.6) here!] 


v’ Py! Py, =—2 Py" Pp’ 
— —2[(1— B) p—am]y"[G—a@) p’— Bm] (14) 


where in the last step we remembered that I" is to be sandwiched between 
u(p’) and u(p). Again, it is convenient to organize the terms in (14) by 
powers of m. With the various tricks we have already used, we find that the 
m” term can be thrown away, the m term gives 2m(p’ + p)“[a(i — a) + 
B(i — B)], and the m® term gives 2m(p’ + p)“[—2(1 — w)(1 —~ B)]. Putting 
it altogether, we find that N“ + 2m(p’ + p)"(a + B)(1—a@ — B) 


We can now do the integral f[d*1/(27)*](1/D) using (D.11). Finally, we obtain 


ri =—2ie? f dad 7_)____y# 
PT (a + B)?m? 
e 1, 
——. —(p'+ p)* 15 
822 Im P +P? (15) 
and thus, trumpets please: 
2 
é af 
F,(0) = — = — 16 
2(0) 2 On (16) 


Schwinger’s announcement of this result in 1948 had an electrifying impact on the 
theoretical physics community. 

I gave you in this chapter not one, but two, of the great triumphs of twentieth 
century physics, although admittedly the first is not a result of field theory per se. 


Exercises 


11.6.1. Evaluate u(p’)( p’y" + y" p)u(p) in two different ways and thus prove Gordon 
decomposition. 
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Check that (7) is consistent with current conservation. [Hint: By translation 
invariance (we suppress the spin variable) 


(p'| J¥(x) |p) = (p'| J¥(0) {p)el(P—P) 
and hence 
(p' 8, J*(x) |p) =i(p' — Pp), (p'| JMO) |pyel?—P* 


Thus current conservation implies that g# (p’| J“(0) ip) =0.] 


By Lorentz invariance the right hand side of (7) has to be a vector. The only 
possibilities are wy"u, (p + p’)“iu, and (p — p’)“iiu. The last term is ruled out 
because it would not be consistent with current conservation. Show that the form 
given in (7) is in fact the most general allowed. 


Chapter III.7 


Polarizing the Vacuum and 
Renormalizing the Charge 


A photon can fluctuate into an electron and a positron 


One early triumph of quantum electrodynamics is the understanding of how quan- 
tum fluctuations affect the way the photon propagates. A photon can always meta- 
morphose into an electron and a positron that, after a short time mandated by the 
uncertainty principle, annihilate each other becoming a photon again. The process, 
which keeps on repeating itself, is depicted in Figure II.7.1. 

Quantum fluctuations are not limited to what we just described. The electron 
and positron can interact by exchanging a photon, which in turn can change into an 
electron and a positron, and so on and so forth. The full process is shown in Figure 
IlI.7.2, where the shaded object, denoted by T1,,,(q) and known as the vacuum 
polarization tensor, is given by an infinite number of Feynman diagrams, as shown 
in Figure III.7.3. Figure I1.7.1 is obtained from Figure III.7.2 by approximating 
iTl,,,(q) by its lowest order diagram. 

It is convenient to rewrite the Lagrangian £ = vliy" (a, — 1eA,) — my 
—1F,,,F” by letting A > (1/e)A, which we are always allowed to do, so that 


_ i 
£ = wliy"(@, —iA,) — mW — 25 FyyP™ (1) 


Note that the gauge transformation leaving £ invariant is given by yw > ely 
and A,, > A, + 9,a. The photon propagator (Chapter III.4), obtained roughly 
speaking by inverting (1/4e7) F uv #", is now proportional to e?: 


(2) 


. —1eé 2 Gudv 

iDiyv@) = Po & —(d-&) ¢ 
Every time a photon is exchanged, the amplitude gets a factor of e”. This is just a 
trivial but convenient change and does not affect the physics in the slightest. For ex- 
ample, in the Feynman diagram we calculated in Chapter II.6 for electron-electron 
scattering, the factor e” can be thought of as being associated with the photon prop- 
agator rather than as coming from the interaction vertices. In this interpretation 
e” measures the ease with which the photon propagates through spacetime. The 
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smaller e*, the more action it takes to have the photon propagate, and the harder 
for the photon to propagate, the weaker the effect of electromagnetism. 

The diagrammatic proof of gauge invariance given in Chapter II.7 implies that 
q"T1,,,(¢) = 0. Together with Lorentz invariance, this requires that 


I], () = CACE _ Sud )T(q?) (3) 


The physical or renormalized photon propagator as shown in Figure II.7.1 is 
then given by the geometric series 


iD), @) =i Dy @) + iDyx @iMp(@)iD pv) 
+ ED, (qi, @)i Doo (Q)iNgn (Qi Div (q) te 


+ 2 
—teé 
= syy{l — eM’) + [e°Mq”)P +---}+4,4, term 
q 


ie? 


1 
—>— > + term 4 
7 Set en @? Id (4) 
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Because of (3) the (1 — §)(¢,.9,/ q’) part of D,,,(q) is annihilated when it encoun- 
ters 11,,(q). Thus, ini D/’ (q) the gauge parameter enters only into the q,,q, term 
and drops out in physical amplitudes, as explained in Chapter II.7. 
. . . P . - . 
The residue of the pole in iD) »(4) is the physical or renormalized charge 
squared: 


1 

2 2 

= — 5 
er =e 1+ e211(0) @) 


Respect for gauge invariance 


In order to determine é€p in terms of e, let us calculate to lowest order 


a d*p yy i pi ) 
iT (a) = ( ) f <2 (ir Sa Tom ©) 


For large p the integrand goes as 1/p* with a subleading term going as m”/p* 
causing the integral to have a quadratically divergent and a logarithmically diver- 
gent piece. (You see, it is easy to slip into bad language.) Not a conceptual problem 
at all, as I explained in Chapter III.1. We simply regularize. But now there is a del- 
icate point: Since gauge invariance plays a crucial role, we must make sure that 
our regularization respects gauge invariance. 

In the Pauli-Villars regularization (III.1.13) we replace (6) by 


id= | as "(pe gon Gow) 
HTT, y(q) = ( | tr [ly p+d—m’ p—m 


— Seat (ix" ir om )] (7) 


Now the integrand goes as (1 — }° c,)(1/ p*) with a subleading term going as 
(m? — >>, cgm?)(1/p*), and thus the integral would converge if we choose c, 
and m,, such that 


> Cg=1 (8) 
and 


Yo cg, =m’ (9) 


Clearly, we have to introduce at least two regulator masses. We are confessing to 
ignorance of the physics above the mass scale m,. The integral in (7) is effectively 
cut off when the momentum p exceeds m,. 
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The gauge invariant form (3) we expect to get actually suggests that we need 
fewer regulator terms than we think. Imagine expanding (6) in powers of q. Since 


1,,,(q) = (Gudv —~ 8rd ITI (0) ++] 


we are only interested in terms of O(q) and higher in the Feynman integral. If 
we expand the integrand in (6), we see that the term of O(q) goes as 1/p* for 
large p, thus giving a logarithmically divergent (speaking bad language again!) 
contribution. (Incidentally, you may recall that this sort of argument was also used 
in Chapter III.3.) It seems that we need only one regulator. This argument is not 
rigorous because we have not proved that II1(q”) has a power series expansion in 
q’, but instead of worrying about it let us proceed with the calculation. 

Once the integral is convergent, the proof of gauge invariance given in Chapter 
II.7 now goes through. Let us recall briefly how the proof went. In computing 
q'¥1(q) we use the identity 


ie ny ee ne 
p+d-m”" pm pom p+d-m 


to split the integrand into two pieces that cancel upon shifting the integration 
variable p + p +q. Recall from Exercise (1.7.2) that we were concerned that 
in some cases the shift may not be allowed, but it is allowed if the integral is 
sufficiently convergent, as is indeed the case now that we have regularized. In any 
event, the proof is in the pudding, and we will see by explicit calculation that 
II, ,(q) indeed has the form in (3). 

Having learned various computational tricks in the previous chapter you are now 
ready to tackle the calculation. I will help by walking you through it. In order not 
to clutter up the page I will suppress the regulator terms in (7) in the intermediate 
steps and restore them toward the end. After a few steps you should obtain 


d*p Nuv 
(2x)4 D 


iT1,,,(q) =-1 

where N,,, =ttly,( b+ ¢ +m)y,( p —m)] and 
1 | 
a= 5 


with D = [/? +a(1—a)q* —m*+ie}, wherel = p + aq. Eliminating p in favor 
of / and beating on N.,,, you will find that N.,,, is effectively equal to 


] 
—4 (Feu! + a(1 _ a) (24,4, ~~ Bu) ~~ m4) 


Integrate over / using (D.12) and (D.13) and, writing the contribution from the 
regulators explicitly, obtain 
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1 1 
1 v(q) = eo) [ da Fai — » ct) (10) 
where 
Fy(m) 
= 1, {4° — 2[m? —a(l- a)q’] log ME + m* — a(1— a4? | 
qo" m? — a(1—«a)q? 
2 
— fa(l—@)(29,4y — Burd?) tm’ ey] tog m —a(l—a)q? — 1 (11) 


Remember, you are doing the calculation; I am just pointing the way. In Appendix 
D A was introduced to give meaning to various divergent integrals. Since our 
integral is convergent, we should not need A, and indeed, it is gratifying to see 
that in (10) A drops out thanks to the conditions (8) and (9). Some other terms 
drop out as well, and we end up with 


1 1 
I (q@) = mera ILL — Buvd) [ da a(1—a@) 


(loglm? — a(1 ~@)q7]— } °c, logim? ~a(1—a)g7} (12) 


Lo and behold! The vacuum polarization tensor indeed has the form IT,,,(¢) = 

udv — 8uvd M(q?). Our regularization scheme does respect gauge invariance. 

For g” < m? (the kinematic regime we are interested in had better be much 

lower than our threshold of ignorance) we simply define log M* = )° c, log m2 
a 


in (12) and obtain 


1 f' M? 
M(g2) = —- | da a(1—a) log ———__ (13) 
2n* Jo m2 —a(1—a)q? 


Note that our heuristic argument is indeed correct. In the end, effectively we need 
only one regulator, but in the intermediate steps we needed two. Actually, this 
bickering over the number of regulators is beside the point. 

In Chapter IEI.1 I mentioned dimensional regularization as an alternative to 
Pauli-Villars regularization. Historically, dimensional regularization was invented 
to preserve gauge invariance in nonabelian gauge theories (which I will discuss in 
a later chapter). It is instructive to calculate TI using dimensional regularization 
(Exercise IEL7.1). 
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(a) (b) (<) 


Figure ITI.7.4 


Electric charge 


Physically, we end up with a result for I(q*) containing a parameter M? express- 
ing our threshold of ignorance. We conclude that 


@ae@—__} dng (1- : lo *) (14) 
Ro 1+ (€2/122?) log(M2/m2° Lon?” in? 


Quantum fluctuations effectively diminish the charge. I will explain the physical 
origin of this effect in a later chapter on renormalization group flow. 

You might argue that physically charge is measured by how strongly one elec- 
tron scatters off another electron. To order e* , in addition to the diagrams in Chap- 
ter 11.7, we also have, among others, the diagrams shown in Figure II.7.4a,b,c. We 
have computed 4a, but what about 4b and 4c? In many texts, it is shown that con- 
tributions of II.7.4b and III.7.4c to charge renormalization cancel. The advantage 
of using the Lagrangian in (1) is that this fact becomes self-evident: Charge is a 
measure of how the photon propagates. 

To belabor a more or less self-evident point let us imagine doing physical or 
renormalized perturbation theory as explained in chapter IH.3. The Lagrangian 
is written in terms of physical or renormalized fields (and as before we drop the 
subscript P on the fields) 


rae . 1 v 
L=wiy"(@, —iA,) —mp)v — — Fy Fe 
4e> 


+ Apiy"(8, —iA,)v + BU — CF) FY (15) 


where the coefficients of the counterterms A, B, and C are determined iteratively. 
The point is that gauge invariance guarantees that fiy"d,,p and py"A,,y always 
occur in the combination piy“(8,, —iA,,)y: The strength of the coupling of A,, 
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to Wy" cannot change. What can change is the ease with which the photon 
propagates through spacetime. 

This statement has profound physical implications. Experimentally, it is known 
to a high degree of accuracy that the charges of the electron and the proton are 
opposite and exactly equal. If the charges were not exactly equal, there would be a 
residual electrostatic force between macroscopic objects. Suppose we discovered a 
principle that tells us that the bare charges of the electron and the proton are exactly 
equal (indeed, as we will see, in grand unification theories, this fact follows from 
group theory.) How do we know that quantum fluctuations would not make the 
charges slightly unequal? After all, the proton participates in the strong interaction 
and the electron does not and thus many more diagrams would contribute to 
the long range electromagnetic scattering between two protons. The discussion 
here makes clear that this equality will be maintained for the obvious reason that 
charge renormalization has to do with the photon. In the end, it is all due to gauge 
invariance. 


Analyticity 


Consider the function F1(s) defined by (13). By analytic continuation it can be 
defined in the entire complex s plane. It is conventional to define log z as having a 
cut on the negative real axis, and thus the integrand in (13) has a cut on the positive 
real g?-axis extending from q* = m?/a(1— a) to infinity. Since in the range of 
integration the maximum value of a(1 — a) is }, TI (q’) is an analytic function in 
the complex q? plane with a cut along the real axis starting from q? =4m?. 

Physically, consider a virtual photon propagating along with momentum g# = 
(/@2, 0). For /q2 > 2m, there is enough energy for it to actually become a real 
electron-positron pair, not just a virtual pair that fluctuates out of existence almost 
immediately. That is evidently the physical origin of the cut. (If you have done the 
exercises religiously, you will realize that these points were already developed in 
Exercises 1.7.5 and III.1.2.) 

Considering how amplitudes are calculated in quantum field theory, as integrals 
over products of propagators, it is more or less clear that amplitudes are analytic 
functions of the external kinematic variables. One example is the scattering am- 
plitude M in Chapter III.1: It is manifestly an analytic function of s, ¢, and u 
with various cuts. From the late 1950s till the early 1960s considerable effort was 
devoted to studying the analyticity properties of Feynman diagrams.! 


' A technical aside: Restoring the ie in the propagators we dropped along the way, we 
see that in the integral representation for I1(q) we should write m? — ie. This instructs us 
to evaluate I1(q) slightly above the cut for g? > 4m’. 
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Modifying the Coulomb potential 


We have focused on charge renormalization, which is determined completely 
by T1(0), but in (13) we obtained the complete function TI(g*), which tells us 
how the g dependence of the photon propagator is modified. According to the 
discussion in Chapter I.5, the Coulomb potential is just the Fourier transform of 
the photon propagator (see also Exercise HI.5.3). Thus, the Coulomb interaction 
is modified from the venerable 1/r law at a distance scale of the order of (2m), 
namely the inverse of the characteristic value of g in T1(q’). This modification was 
experimentally verified as part of the Lamb shift in atomic spectroscopy, another 
great triumph of quantum electrodynamics. 


Exercises 


11.7.1. Calculate I,,,,(q) using dimensional regularization. The procedure is to start with 
(6), evaluate the trace in N,,, shift the integration momentum from p to/, and so 
forth, proceeding exactly as in the text, until you have to integrate over the loop 
momentum /. At that point you “pretend” that you are living in d-dimensional 
spacetime, so that the term like /,,/, in N,,,, for example, is to be effectively 
replaced by (1/d)g,,,/?. The integration is to be performed using (III.1.16) and 
various generalizations thereof. Show that the form (3) automatically emerges 


when you continue to d = 4. 


1.7.2. Study the modified Coulomb’s law as determined by the Fourier integral 
f@qyPu+en@ ype. 


PART IV 


SYMMETRY 
AND 
SYMMETRY BREAKING 


Chapter IV. 1 
Symmetry Breaking 


A symmetric world would be dull 


While we would like to believe that the fundamental laws of Nature are symmetric, 
a completely symmetric world would be rather dull, and as a matter of fact, the 
real world is not perfectly symmetric. More precisely, we want the Lagrangian, 
but not the world described by the Lagrangian, to be symmetric. Indeed, a central 
theme of modern physics is the study of how symmetries of the Lagrangian can be 
broken. We will see in subsequent chapters that our present understanding of the 
fundamental laws is built upon an understanding of symmetry breaking. 
Consider the Lagrangian studied in Chapter I.9: 


_— 1p as2 252] A 22,2 
b= 5 | 0d — we] -2@) () 


where ¢ = (9), @,°* +, Qy). This Lagrangian exhibits an O(N) symmetry under 
which @ transforms as an N-component vector. 

We can easily add terms that do not respect the symmetry. For instance, add 
terms such as 97, y} and y7¢" and break the O(N) symmetry down to O(N — 1), 
under which 95, ..., @,y rotate as an (N — 1)-component vector. This way of 
breaking the symmetry, “by hand” as it were, is known as explicit breaking. 

We can break the symmetry in stages. Obviously, if we want to, we can break 
it down to O(N — M) by hand, for any M < N. 

Note that in this example, with the terms we added, the reflection symmetry 
Yq —> —Y, (any a) still holds. It is easy enough to break this symmetry as well, by 
adding a term such as 3, for example. 

Breaking the symmetry by hand is not very interesting. Indeed, we might as 
well start with a nonsymmetric Lagrangian in the first place. 


Spontaneous symmetry breaking 
A more subtle and interesting way is to let the system “break the symmetry itself,” 
a phenomenon known as spontaneous symmetry breaking. I will explain by way 
of an example. Let us flip the sign of the ¢” term in (1) and write 
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Viq) 


Figure I'V.1.1 


1 ~ + A. 
=; [a +0 |-2@Y Q) 


Naively, we would conclude that for small 1 the field g creates a particle of mass 
¥ —u2 =i. Something is obviously wrong. 

The essential physics is analogous to what would happen if we give the spring 
constant in an anharmonic oscillator the wrong sign and write L = 1(g* + kq?) — 
(4./4)q*. We all know what to do in classical mechanics. The potential energy 
V(q) = —}kq? + (A/4)q‘ [known as the double-well potential (Figure IV.1.1)] has 


two minima at g = +v, where v = (k/A) 2. At low energies, we choose either one 
of the two minima and study small oscillations around that minimum, Committing 
to one or the other of the two minima breaks the reflection symmetry g — —q of 
the system. 

In quantum mechanics, however, the particle can tunnel between the two min- 
ima, the tunneling barrier being V(0) — V(+v). The probability of being in one 
or the other of the two minima must be equal, thus respecting the reflection sym- 
metry g — ~—gq of the Hamiltonian. In particular, the ground state wave function 
¥(q) = (—@q) is even. 

Let us try to extend the same reasoning to quantum field theory. For a generic 
scalar field Lagrangian & = 1(d9y)* — 4(8,g)? — V(@) we again have to find 
the minimum of the potential energy { d?x[4(a,g)* + V(g)], where D is the 
dimension of space. Clearly, any spatial variation in g only increases the energy, 
and so we set g(x) to equal a spacetime independent quantity g and look for the 
minimum of V (g). In particular, for the example in (2), we have 


— 1 an | A o2,2 
Vg)= Ghee +7@) (3) 


As we will see, the N = 1 case is dramatically different from the N > 2 cases. 
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Difference between quantum mechanics and quantum field theory 


Study the N = 1 case first. The potential V(g) looks exactly the same as the 
potential in Figure IV.1.1 with the horizontal axis relabeled as yg. There are two 
minima at g = +v = +(u2/a)?. 

But some thought reveals a crucial difference between quantum field theory and 
quantum mechanics. The tunneling barrier is now [V(0) — V(+v)] f d?x (where 
D denotes the dimension of space) and hence infinite (or more precisely, exten- 
sive with the volume of the system)! Tunneling is shut down, and the ground state 
wave function is concentrated around either +-v or —v. We have to commit to one 
or the other of the two possibilities for the ground state and build perturbation 
theory around it. It does not matter which one we choose: The physics is equiv- 
alent. But by making a choice, we break the reflection symmetry g > —g of the 
Lagrangian. 

The reflection symmetry is broken spontaneously! We did not put symmetry 
breaking terms into the Lagrangian by hand but yet the reflection symmetry is 
broken.! 

Let’s choose the ground state at +v and write g = v + g’. Expanding in gy’ we 
find after a bit of arithmetic that 


us 1 2 2/2 B 
ha +59) ue” — O|@ >) (4) 
The physical particle created by the shifted field y’ has mass \/2y. The physical 
mass squared has to come out positive since, after all, it is just V” (P)lp=v» as you 
can see after a moment’s thought. 

Similarly, you would recognize that the first term in (4) is just V(9)|,—,. If 
we are only interested in the scattering of the mesons associated with ¢’ this term 
does not enter at all. Indeed, we are always free to add an arbitrary constant to & 
to begin with. We had quite arbitrarily set V (g = 0) equal to 0. The same situation 
appears in quantum mechanics: In the discussion of the harmonic oscillator the 
zero point energy 44 is not observable; only transitions between energy levels 
are physical. We will return to this point in Chapter VIII.2. 

Yet another way of looking at (2) is that quantum field theory amounts to doing 
the Euclidean functional integral 


z= J Doyen f d*xthl@eY wo" 307} 


and perturbation theory just corresponds to studying the small oscillations around 
a minimum of the Euclidean action. Normally, with the sign of gy? in £ negative, 


! An insignificant technical aside for the nitpickers: Strictly speaking, in field theory the 
ground state wave function should be called a wave functional, since Vg (x)]is a functional 
of the function g(x). 
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we expand around the minimum g = 0. With the sign of y” negative, g = 0 is a 
local maximum and not a minimum. 

In quantum field theory what is called the ground state is also known as the 
vacuum, since it is literally the state in which the field is “at rest,” with no particles 
present. Here we have two physically equivalent vacua from which we are to choose 
one. The value assumed by ¢ in the ground state, either v or —v in our example, is 
known as the vacuum expectation value of y. The field g is said to have acquired 
a vacuum expectation value. 


Continuous symmetry 


Let us now turn to (2) with N > 2. The potential (3) is shown in Figure IV.1.2 
for N = 2. The shape of the potential has been variously compared to the bottom 
of a punted wine bottle or a Mexican hat. The potential is minimized at ¢2 = 
2/4. Something interesting is going on: We have an infinite number of vacua 
characterized by the direction of ¢ in that vacuum. Because of the O(2) symmetry 
of the Lagrangian they are all physically equivalent. The result had better not 
depend on our choice. So let us choose ¢ to point in the 1 direction, that is, 
g) =v=+/pu"/d and g = 0. 

Now consider fluctuations around this field configuration, in other words, write 
~, =v +g; and g2 = 5, plug into (2) for N = 2, and expand £ out. I invite you 
to do the arithmetic. You should find (after dropping the primes on the fields; why 
clutter the notation, right?) 


4 

1 

&=4 4-1 (ap)? + (82)"] - n79? + 0) 5) 
4, 2 

The constant term is exactly as in (4), and just like the field g’ in (4), the field ¢, 

has mass ./2j. But now note the remarkable feature of (5): the absence of a 3 

term. The field @2 is massless! 


Emergence of massless boson 


That g2 Comes out massless is not an accident. I will now explain that the mass- 
lessness is a general and exact phenomenon. 

Referring back to Figure [V.1.2 we can easily understand the particle spectrum. 
Excitation in the g, field corresponds to fluctuation in the radial direction, “‘climb- 
ing the wall’ so to speak, while excitation in the g field corresponds to fluctuation 
in the angular direction, “rolling along the gutter’ so to speak. It costs no energy for 
a marble to roll along the minima of the potential energy, going from one minimum 
to another. Another way of saying this is to picture a long wavelength excitation of 
the form g) = a sin(wt — kx) with a small. In a region of length scale small com- 
pared to [k| —! the field g, is essentially constant and thus the field ¢ is just rotated 
slightly away from the 1 direction, which by the O(2) symmetry is equivalent to 
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Figure IV.1.2 


the vacuum. It is only when we look at regions of length scale large compared to 
[k|~ ~1 that we realize that the excitation costs energy. Thus, as tk| > 0, we expect 
the energy of the excitation to vanish. 

We now understand the crucial difference between the N = 1 and the N = 2 
cases: In the former we have a reflection symmetry, which is discrete, while in the 
latter we have an O(2) symmetry, which is continuous. 

We have worked out the N = 2 case in detail. You should now be able to 
generalize our discussion to arbitrary N > 2 (see Exercise (IV.1.1). 

Meanwhile, it is worth looking at N = 2 from another point of view. Many 
field theories can be written in more than one form and it is important to know 
them under different guises. Construct the complex field g = (1/2) (9, +i); 
we have y!y = L(g? + 95) and so can write (2) as 


£ =ag'ag + wo" - ae oy” (6) 


which is manifestly invariant under the U(1) transformation g > e'g (recall 
Chapter I.9). You may recognize that this amounts to saying that the groups O(2) 
and U(1) are locally isomorphic. Just as we can write a vector in Cartesian or 
polar coordinates we are free to parametrize the field by g(x) = p(x)e!? (as 
in Chapter HI.5) so that 8,9 = (0,0 + ipa,,0)e'?. We obtain £ = p*(a6)* + 
(ap)? + up” — Ap*. Spontaneous symmetry breaking means setting p = v + x 
with v = +4/47/2A, whereupon 
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2n 
& = v°(06)? + | Ox)? — 2y?x? — 4 x3 — ax! 
22 2 2 
+ a Xx+x° 1 (6) (7) 


We recognize the phase @(x) as the massless field. We have arranged the terms 
in the Lagrangian in three groups: the kinetic energy of the massless field 6, the 
kinetic and potential energy of the massive field x , and the interaction between 0 
and x. 


Goldstone’s theorem 


We will now prove Goldstone’s theorem, which states that whenever a continuous 
symmetry is spontaneously broken, massless fields, known as Nambu-Goldstone 
bosons, emerge. 

Recall that associated with every continuous symmetry is a conserved charge 
Q. That Q generates a symmetry is stated as 


[H, Q]=0 (8) 


Let the vacuum (or ground state in quantum mechanics) be denoted by |0). By 
adding an appropriate constant to the Hamiltonian H — H +c we can always 
write H |0) = 0. Normally, the vacuum is invariant under the symmetry transfor- 
mation, e!°2 |Q) = |0), or in other words Q |0) = 0. 

But suppose the symmetry is spontaneously broken, so that the vacuum is not 
invariant under the symmetry transformation; in other words, Q |0) 4 0. Consider 
the state Q |0). What is its energy? Well, 


HQ |0) ={H, Q}/0)=0 (9) 


[The first equality follows from H |0) = 0 and the second from (8).] Thus, we have 
found another state Q |0} with the same energy as |0). 
Note that the proof makes no reference to either relativity or fields. You can 
also see that it merely formalizes the picture of the marble rolling along the gutter. 
In quantum field theory, we have local currents, and so 


Q= f a?xI°G,2 


where D denotes the dimension of space and conservation of Q says that the 
integral can be evaluated at any time. Consider the state 


|s) = f axe 9G, t) |0) 
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which has* spatial momentum k. Ask goes to zero it goes over to Q |0), which as 
we learned in (9) has zero energy. Thus, as the momentum of the state |s) goes to 
zero, its energy goes to zero. In a relativistic theory, this means precisely that |s) 
describes a massless particle. 

The proof makes clear that the theorem practically exudes generality: It applies 
to any spontaneously broken continuous symmetry. 


Counting Nambu-Goldstone bosons 


From our proof, we see that the number of Nambu-Goldstone bosons is clearly 
equal to the number of conserved charges that do not leave the vacuum invariant, 
that is, do not annihilate |0). For each such charge Q®, we can construct a zero- 
energy state Q* |0). 

In our example, we have only one current J,, = 1(@9,,92 — 29,91) and hence 
one Nambu-Goldstone boson. In general, if the Lagrangian is left invariant by a 
symmetry group G with n(G) generators, but the vacuum is left invariant by only 
a subgroup H of G with n(#) generators, then there are n(G)— n(H) Nambu- 
Goldstone bosons. If you want to show off your mastery of mathematical jargon 
you can say that the Nambu-Goldstone bosons live on the coset space G/H. 


Ferromagnet and spin wave 


The generality of the proof suggests that the usefulness of Goldstone’s theorem 
is not restricted to particle physics. In fact, it originated in condensed matter 
physics, the classic example there being the ferromagnet. The Hamiltonian, bemg 
composed of just the interaction of nonrelativistic electrons with the ions in the 
solid, is of course invariant under the rotation group S O(3), but the magnetization 
M picks out a direction, and the ferromagnet is left invariant only under the 
subgroup S O (2) consisting of rotations about the axis defined by M. The Nambu- 
Goldstone theorem is easy to visualize physically. Consider a “spin wave” in which 
the local magnetization M (3) varies slowly from point to point. A physicist living 
in a region small compared to the wavelength does not even realize that he or she is 
no longer in the “vacuum.” Thus, the frequency of the wave must go to zero as the 
wavelength goes to infinity. This is of course exactly the same heuristic argument 
given earlier. Note that quantum mechanics is needed only to translate the wave 


2 Acting on it with P’ (Exercise I.10.3) and using P? |0) = 0, we have 
P'|s) = J dxe!*?t pi, P%, 1)] 10) =k |s) 


upon integrating by parts. 
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vector & into momentum and the frequency w into energy. I will come back to 
magnets and spin wave in Chapters V.3 and VI.5. 


Quantum fluctuations and the dimension of spacetime 


Our discussion of spontaneous symmetry breaking is essentially classical. What 
happens when quantum fluctuations are included? I will address this question in 
detail in Chapter [V.3, but for now let us go back to (5). In the ground state, g, = v 
and ~, = 0. Recall that in the mattress model of a scalar field theory the mass term 
comes from the springs holding the mattress to its equilibrium position. The term 
—p’’ (note the prime) in (5) tells us that it costs action for g’ to wander away 
from its ground state value g, = 0. But now we are worried: ¢ is massless. Can it 
wander away from its ground state value? To answer this question let us calculate 
the mean square fluctuation 


1 
((@0)) = = | Doe!S [p,(0)P 


. 1 ; 
= lim = | Dee'® 9) (x)@(0) 
x30 7 
d¢k elkx 
im —_ — 
x30 (27 )4 k? 


(10) 


(We recognized the functional integral that defines the propagator; recall Chapter 
1.7.) The upper limit of the integral in (10) is cut off at some A (which would 
correspond to the inverse of the lattice spacing when applying these ideas to a 
ferromagnet) and so as explained in Chapter III.1 (and as you will see in Chapter 
VIII.3) we are not particularly worried about the ultraviolet divergence for large 
k. But we do have to worry about a possible infrared divergence for small k. (Note 
that for a massive field 1/k* in (10) would have been replaced by 1/(k? + yx”) and 
there would be no infrared divergence.) 

We see that there is no infrared divergence for d > 2. Our picture of spon- 
taneously breaking a continuous symmetry is valid in our (3 + 1)-dimensional 
world. 

However, for d < 2 the mean square fluctuation of gy, comes out infinite, so 
our naive picture is totally off. We have arrived at the Coleman-Mermin- Wagner 
theorem (proved independently by a particle theorist and two condensed matter 
theorists), which states that spontaneous breaking of a continuous symmetry is 
impossible for d = 2. Note that while our discussion is given for O(2) symmetry 
the conclusion applies to any continuous symmetry since the argument depends 
only on the presence of Nambu-Goldstone fields. 

In our examples, symmetry is spontaneously broken by a scalar field g, but 
nothing says that the field g must be elementary. In many condensed matter 
systems, superconductors, for example, symmetries are spontaneously broken, 
but we know that the system consists of electrons and atomic nuclei. The field 
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~ is generated dynamically, for example as a bound state of two electrons in 
superconductors. More on this in Chapter V.4. The spontaneous breaking of a 
symmetry by a dynamically generated field is sometimes referred to as dynamical 
symmetry breaking.” 


Exercises 


IV.1.1. Show explicitly that there are N — 1 Nambu-Goldstone bosons in the G = O(N) 
example (2). 


IV.1.2. Construct the analog of (2) with N complex scalar fields and invariant under 


SU(N). Count the number of Nambu-Goldstone bosons when one of the scalar 
fields acquires a vacuum expectation value. 


This chapter is dedicated to the memory of the late Jorge Swieca. 


Chapter ['V.2 


The Pion as a Nambu-Goldstone Boson 


Crisis for field theory 


After the spectacular triumphs of quantum field theory in the electromagnetic 
interaction, physicists in the 1950s and 1960s were naturally eager to apply it 
to the strong and weak interactions. As we have already seen, field theory when 
applied to the weak interaction appeared not to be renormalizable. As for the 
strong interaction, field theory appeared totally untenable for other reasons. For 
one thing, as the number of experimentally observed hadrons (namely strongly 
interacting particles) proliferated, it became clear that were we to associate a 
field with each hadron the resulting field theory would be quite a mess, with 
numerous arbitrary coupling constants. But even if we were to restrict ourselves to 
nucleons and pions, the known coupling constant of the interaction between pions 
and nucleons is a large number. (Hence the term strong interaction in the first 
place!) The perturbative approach that worked so spectacularly well in quantum 
electrodynamics was doomed to failure. 

Many eminent physicists at the time advocated abandoning quantum field 
theory altogether, and at certain graduate schools, quantum field theory was even 
dropped from the curriculum. It was not until the early 1970s that quantum field 
theory made a triumphant comeback. A field theory for the strong interaction was 
formulated, not in terms of hadrons, but in terms of quarks and gluons. I will get 
to that in Chapter VII.3. 


Pion weak decay 


To understand the crisis facing field theory, let us go back in time and imagine 
what a field theorist might be trying to do in the late 1950s. Since this is not a 
book on particle physics, I will merely sketch the relevant facts. You are urged 
to consult one of the texts on the subject.’ By that time, many semileptonic 
decays such aan > p+e +0,” —~e +), and xn >n +e7 + v had 
been measured. Neutron 6 decay n > p +e” +9 was of course the process 


1 See, e.g., E. Commins and P. H. Bucksbaum, Weak Interactions of Leptons and Quarks. 
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for which Fermi invented his theory, which by that time had assumed the form 
£=G[e— y"*(1— ys)v py, — ys)n], where n is a neutron field annihilating 
a neutron, p a proton field annihilating a proton, v a neutrino field annihilating 
a neutrino (or creating an antineutrino as in 8 decay), and e an electron field 
annihilating an electron. 

It became clear that to write down a field for each hadron and a Lagrangian for 
each decay process, as theorists were in fact doing for a while, was a losing battle. 
Instead, we should write 


with Ji and 5 two currents transforming as a Lorentz vector, and axial vec- 
tor respectively. We think of J,, and Js,, as quantum operators in a canoni- 
cal formulation of field theory. Our task would then be to calculate the ma- 
trix elements between hadron states, (p| (J, — J5,,)|n), (OW, — Js) la), 
(x°| (J,, — J5,,) |}, and so on, corresponding to the three decay processes I 
listed above. (I should make clear that although I am talking about weak decays, 
the calculation of these matrix elements is a problem in the strong interaction. 
In other words, in understanding these decays, we have to treat the strong in- 
teraction to all orders in the strong coupling, but it suffices to treat the weak 
interaction to lowest order in the weak coupling G.) Actually, there is a prece- 
dent for the attitude we are adopting here. To account for nuclear B decay 
(Z, A) > (Z2 +1, A) +e7 +0, Fermi certainly did not write a separate La- 
grangian for each nucleus. Rather, it was the task of the nuclear theorist to calculate 
the matrix element (Z + 1, Al [py,(1— y5)n]|Z, A). Similarly, it is the task of the 
strong interaction theorist to calculate matrix elements such as (p| (J,, — Js,,) |). 

For the story I am telling, let me focus on trying to calculate the matrix element 
of the.axial vector current Js between a neutron and a proton. Here we make a 
trivial change in notation: We no longer indicate that we have a neutron in the 
initial state and a proton in the final state, but instead we specify the momentum 
p of the neutron and the momentum p’ of the proton. Incidentally, in (1) the fields 
and the currents are of course all functions of the spacetime coordinates x. Thus, 
we want to calculate (p’| J;'(x) |p), but by translation invariance this is equal 
to (p'| JH (0) |p)e'"—-)*. Henceforth, we simply calculate (p'| J#(0) |p) and 
suppress the 0. Note that spin labels have already been suppressed. 

Lorentz invariance and parity can take us some distance: They imply that? 


(p'| JE lp) = apy“? F(q@?) + 4" v°G(q*)u(p) (2) 


with g = p’ — p [compare with (III.6.9)]. But Lorentz invariance and parity can 
only take us so far: We know nothing about the “form factors” F (q*) and G(q’). 


* Another possible term of the form (p’ + p)“y°> can be shown to vanish by charge 
conjugation and isospin symmetries. 
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Similarly, for the matrix element ({0| JE \7 —) Lorentz invariance tells us that 
(O| Js |k) = fk¥ (3) 


I have again labeled the initial state by the momentum k of the pion. The right- 
hand side of (3) has to be a vector but since k is the only vector available it has to be 
proportional to k. Just like F (g”) and G(q”), the constant f is a strong interaction 
quantity that we don’t know how to calculate. On the other hand, F (q’), G(q’), 
and f can all be measured experimentally. For instance, the rate for the decay 
m~ —> e~ + Dclearly depends on f?. 


Too many diagrams 


Let us look over the shoulder of a field theorist trying to calculate {p’| Je |p) and 
(0| Js" |k) in (2) in the late 1950s. He would draw Feynman diagrams such as 
the ones in Figures IV.2.1 and IV.2.2 and soon realize that it would be hopeless. 
Because of the strong coupling, he would have to calculate an infinite number of 
diagrams, even if the strong interaction were described by a field theory, a notion 
already rejected by many luminaries of the time. 

In telling the story of the breakthrough I am not going to follow the absolutely 
fascinating history of the subject, full of total confusion and blind alleys. Instead, 
with the benefit of hindsight, I am going to tell the story in what I regard as the 
best pedagogical approach. 


0x4 


(a) (b) 


Figure I'V.2.1 
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Figure IV.2.2 


The pion is very light 


The breakthrough originated in the observation that the mass of the 2~ at 139 
Mev was considerably less than the mass of the proton at 938 Mev. For a long time 
this was simply taken as a fact not in any particular need of an explanation. But 
eventually some theorists wondered why one hadron should be so much lighter 
than another. 

Finally, some theorists took the bold step of imagining an “ideal world” in 
which the z~ is massless. The idea was that this ideal world would be a good 
approximation of our world, to an accuracy of about 15% (~139/938). 

Do you remember one circumstance in which a massless spinless particle would 
emerge naturally? Yes, spontaneous symmetry breaking! In one of the blinding 
insights that have characterized the history of particle physics, some theorists 
proposed that the 7 mesons are the Nambu-Goldstone bosons of some spontaneous 
broken symmetry. 

Indeed, let’s multiply (3) by k,, : 


k,,(O| Jf |k) = fk = fm? (4) 


which is equal to zero in the ideal world. Recall from our earlier discussion on 
translation invariance that 


(0| JA (x) {k) = 0] JE) [ke ** 
and hence 
(0| d,J3 (x) [k) = —ik,, (O| JE(O) Ik)e kx 


Thus, if the axial current is conserved, 9,,J: (x) =0, in the ideal world, 
k,,(0| J’ |k) = 0 and (4) would indeed imply m2 = 0. 

The ideal world we are discussing enjoys a symmetry known as the chiral 
symmetry of the strong interaction. The symmetry is spontaneously broken in the 
ground state we inhabit, with the 7 meson as the Nambu-Goldstone boson. The 
Noether current associated with this symmetry is the conserved J". 

In fact, you should recognize that the manipulation here is closely related to the 
proof of the Nambu-Goldstone theorem given in Chapter IV.1. 
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Goldberger-Treiman relation 


Now comes the punchline. Multiply (2) by (p’ — p) u: By the same translation 
invariance argument we just used, 


(’ — p), (P| JE) |p) = i(p'1 4, JE (x) [pel 


and hence vanishes if 4,,J;° =0. On the other hand, multiplying the right-hand 


side of (2) by (p’ — p),, we obtain a(p')I(p’ — p)y°Fq@*) + a’ y°Gq?)u(p). 
Using the Dirac equation (do it!) we conclude that 


0 = 2my F(q*) + 4°G(q’) (5) 


with my the nucleon mass. 

The form factors F (q”) and G(q”) are each determined by an infinite number 
of Feynman diagrams we have no hope of calculating, but yet we have managed to 
relate them! This represents a common strategy in many areas of physics: When 
faced with various quantities we don’t know how to calculate, we can nevertheless 
try to relate them. 

We can go farther by letting g — 0 in (5). Referring to (2) we see that F'(0) is 
measured experimentally inn — p +e + v (the momentum transfer is negligible 
on the scale of the strong interaction). But oops, we seem to have a problem: We 
predict the nucleon mass my = 0! 

In fact, we are saved by examining Figure IV.2.1b: There are an infinite number 
of diagrams exhibiting a pole due to none other than the massless 7 meson, which 
you can see gives 


fa sennnil(@ 7 UD) ©) 
When the x propagator joins onto the nucleon line, an infinite number of dia- 
grams summed together gives the experimentally measured pion-nucleon coupling 
constant g,.y. Thus, referring to (2), we see that for g ~ 0 the form factor 
G(q*) ~ f(1/a* enn n- Plugging into (5), we obtain the celebrated Goldberger- 
Treiman relation 


2myF(0) + feann =0 (7) 


relating four experimentally measured quantities. As might be expected, it holds 
with about a 15% error, consistent with our not living in a world with an exactly 
massless 7 meson. 


Toward a theory of the strong interaction 


The art of relating infinite sets of Feynman diagrams without calculating them, 
and it is an art form involving a great deal of cleverness, was developed into a 
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subject called dispersion relations and S-matrix theory, which laid the foundation 
on which our present understanding of the strong interaction was built. It may 
be worth mentioning that an important component of dispersion relations was the 
study of the analyticity properties of Feynman diagrams. For example, we show 
in Chapter III.7 that the vacuum polarization diagram has a cut. The essence of 
the Goldberger-Treiman argument is separating the infinite number of diagrams 
into those with a pole in the complex q?-plane and those without a pole (but with 
acut.) 

The discovery that the strong interaction contains a spontaneously broken 
symmetry provided a crucial clue to the underlying theory of the strong interaction 
and ultimately led to the concepts of quarks and gluons. 

A note for the historian of science: Whether theoretical physicists regard a quan- 
tity as small or large depends (obviously) on the cultural and mental framework 
they grew up in. Treiman once told me that the notion of setting 138 Mev to zero, 
when the energy released per nucleon in nuclear fission is of order 10 Mev, struck 
the generation that grew up with the atomic bomb (as Treiman did—he was with 
the armed forces in the Pacific) as surely the height of absurdity. Now of course 
a new generation of young string theorists is perfectly comfortable in regarding 
anything less than the Planck energy 10!? Gev as essentially zero. 


Chapter I'V.3 
Effective Potential 


Quantum fluctuations and symmetry breaking 


The important phenomenon of spontaneous symmetry breaking was based on 
minimizing the classical potential energy V(~) of a quantum field theory. It is 
natural to wonder how quantum fluctuations would change this picture. 

To motivate the discussion, consider once again (III1.3.3) 


£ = 5 (00) — su29? -— Tre! + A@oP + Be? +Co" —) 
(Speaking of quantum fluctuations, we have to include counterterms as indicated.) 
What have you learned about this theory? For 4.2 > 0, the action is extremized 
at g = 0, and quantizing the small fluctuations around g = 0 we obtain scalar 
particles that scatter off each other. For 4* < 0, the action is extremized at some 
Pmin» and the discrete symmetry g — —g is spontaneously broken, as you learned 
in Chapter [V.1. What happens when y. = 0? To break or not to break, that is the 
question. 

A quick guess is that quantum fluctuations would break the symmetry. The 
pt = 0 theory is posed on the edge of symmetry breaking, and quantum fluctuations 
ought to push it over the brink. Think of a classical pencil perfectly balanced on 
its tip and then “turning on” quantum mechanics. 


Wisdom of the son-in-law 


Let us follow Schwinger and Jona-Lasinio and develop the formalism that enables 
us to answer this question. Consider a scalar field theory defined by 


7 = iW) J DoeilS@i+s91 (2) 


[with the convenient shorthand Jg = f d*xJ(x)g(x)]. If we can do the functional 
integral, we obtain the generating functional W(J). As explained in Chapter I.7, 
by differentiating W with respect to the source J (x) repeatedly, we can obtain any 
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Green’s function and hence any scattering amplitude we want. In particular, 


pelx) = oO 3 J Dyell@+4 9 (x) 3) 


The subscript c is used traditionally to remind us (see Appendix 2 in Chapter 1.8) 
that in a canonical formalism g,(x) is the expectation value (0| ¢ |0) of the quantum 
operator @. It is certainly not to be confused with the integration dummy variable 
¢ in (3). To avoid cluttering up the notation, I will drop the subscript c as there is 
really no risk of confusion. 

Given a functional W of J we can perform a Legendre transform to obtain a 
functional I of ¢ (i.e., g,). Legendre transform is just the fancy term for the simple 
relation 


ry) = Wis) — | d*x I(x) ox) (4) 


The relation is simple, but be careful about what it says: It defines a functional of 
w(x) through the implicit dependence of J on g. On the right-hand side of (4) J 
is to be eliminated in favor of g by solving (3). We expand the functional ['(g) in 
the form 


r@) = | d4x[—Veg(y) + Z(Y)(0)? +] 6) 


where (- - -) indicates terms with higher and higher powers of 0. We will soon see 
the wisdom of the notation V.¢¢(¢). 

The point of the Legendre transform is that the functional derivative of I is nice 
and simple: 


OI) Ox) — J 
50(y) 50(y) 5) so) 


= —J(y) (6) 


a relation we can think of as the “dual” of 5W(J)/3J (x) = g(x). 

If you vaguely feel that you have seen this sort of manipulation before in your 
physics eduction, you are quite right! It was in a course on thermodynamics, where 
you learned about the Legendre transform relating the free energy to the energy: 
F = E —TS with F a function of the temperature T and E a function of the 
entropy S. Thus J and are “conjugate” pairs just like T and S (or even more 
clearly magnetic field H and magnetization M). Convince yourself that this is far 
more than a mere coincidence. 

For J and g independent of x we see from (5) that the condition (6) reduces to 


Vor (@) = J (7) 


This relation makes clear what the effective potential V.s-(~) is good for. Let’s ask 
what happens when there is no external source J. The answer is immediate: (7) 
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tells us that 
Vi) = 0, (8) 


In other words, the vacuum expectation value of y in the absence of an external 
source is determined by minimizing Vip(¢). 


First order in quantum fluctuations 


All of these formal manipulations are not worth much if we cannot evaluate W (J). 
In fact, in most cases we can only evaluate eV) = [ Dge'l5(v)+/¢1 in the steepest 
descent approximation (see Chapter I.2). Let us turn the crank and find the steepest 
descent “point” g,(x), namely the solution of 


SS(~) + fdtyJOON | _ 0 6) 
dp(x) %, 
or more explicitly, 
8° y,(x) + W'Ip,(x)] = F(x) (10) 


Write the dummy integration variable in (2) as g = g, + g and expand to quadratic 
order in ¢ to obtain 


Z = ct/DW) — | Dyed /MISW)+91 


~ eli/A1S(0s)+ 5961 J Deel f ax $109°—-V" one 


— el /AIS(os)+J e514 tr logld?+V"@s)] (11) 


We have used (II.5.2) to represent the determinant we get upon integrating over ¢. 
Note that I have put back Planck’s constant f. Here g,, as a solution of (10), is to 
be regarded as a function of J. 

Now that we have determined 


W(J) = [S@5) + Jgs}+ zs tr logld? + V"(g,)}+ Of") 


it is straightforward to Legendre transform. I will go painfully slowly here: 


bw OSS J@,}é 
p= = SEW) F I0I es 4 9, + O(f) = 9, + OF) 


To leading order in A, y (namely the object formerly known as ¢,) is equal to g,. 
Thus, from (4) we obtain 


Pp) = Sty) + 4 tr logla? + V"()] + O@) (12) 


IV.3. Effective Potential 211 


Nice though this formula looks, in practice it is impossible to evaluate the trace 
for arbitrary g(x) : We have to find all the eigenvalues of the operator 67+ V"(y), 
take their log, and sum. Our task simplifies drastically if we are content with 
studying '(g) for g independent of x, in which case V’(g) is a constant and 
the operator 0° + V”(qg) is translation invariant and easily treated in momentum 
space: 


tr log(a? + V"(@)I = | d*x(x| logla? + V"(g)}Ix) 


d‘ 
~ a J aay (x|k) (kl logla? + V"(p)IIk) (kl) 


d*k 
_ 4 __ 72 i 
= [a «| Oni log|—k* + V"(g)] (13) 


Referring to (5), we obtain 


4 2_ yl 
d*k lo * V"(g) 


Onys 2 +O") (14) 


itt 
Ven) = V@) — = | 


known as the Coleman-Weinberg effective potential. What we computed is the 
order # correction to the classical potential V(g). Note that we have added a g 
independent constant to make the argument of the logarithm dimensionless. 

We can give a nice physical interpretation of (14). Let the universe be suffused 
with the scalar field g(x) taking on the value g, a background field so to speak. For 
Vg) = 4u2y? + (1/4) ag*, we have V"(g) = u? + 4Ag* = 2(¢)’, which as the 
notation (vy)? suggests we recognize as the g-dependent effective mass squared 
of a scalar particle propagating in the background field g. The mass squared Va 
in the Lagrangian is corrected by a term 4/7 that is due to the interaction of the 
particle with the background field g. Now we see clearly what (14) tells us: The 
first term V (¢) is the classical energy density contained in the background ¢, while 
the second term is the vacuum energy density of a scalar field with mass squared 
equal to V”’(g) (see (11.5.3) and Exercise IV.3.4). 


Your renormalization theory at work 


The integral in (14) is quadratically divergent, or more correctly, quadratically 
dependent on the cutoff. But no sweat, we were instructed to introduce three 
counterterms (of which only two are relevant here since g is independent of x). 
Thus, we actually have 


Af d+kp ke + V"(g) 
V. = — - 
ey) = V9) + 5 (On)4 l z 


+ Bo’? + Cg* + O(#’) (15) 
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where we have Wick rotated to a Euclidean integral (see Appendix D). Using (D.9) 
and integrating up to kz, = A”, we obtain (suppressing /) 


1 

IV’"@)F , ez A? 
h B Cc 16 
6An? °8 Vig) t gy” +Cy* (16) 


Ver(y) = Vg) + 3552 Av" )- 
As expected, since the integrand in (15) goes as 1/k2, for large k2, the integral 
depends quadratically and logarithmically on the cutoff A’. 
Watch renormalization theory at work! Since V is a quartic polynomial in g, 
V"(g) is a quadratic polynomial and [V” (v)? a quartic polynomial. Thus, we have 
just enough counterterms By* + Cg‘ to absorb the cutoff dependence. This is a 
particularly transparent example of how the method of adding counterterms works. 
To see how bad things can happen in a nonrenormalizable theory, suppose in 
contrast that V is a polynomial of degree 6 in g. Then we are allowed to have 
three counterterms By? + Cy* + De®, but that is not enough since [V"(v)F is 
now a polynomial of degree 8. This means that we should have started out with 
V a polynomial of degree 8, but then [V’(y)} would be a polynomial of degree 
12. Clearly, the process escalates into an infinite-degree polynomial. We see the 
hallmark of a nonrenormalizable theory: its insatiable appetite for counterterms. 


Imposing renormalization conditions 


Waking up from the nightmare of an infinite number of counterterms chasing us, 
let us go back to the sweetly renormalizable g* theory. In Chapter III.3 we fix the 
counterterms by imposing conditions on various scattering amplitudes. Here we 
would have to fix the coefficients B and C by imposing two conditions on V.¢(¢) 
at appropriate values of gy. We are working in field space, so to speak, rather than 
momentum space, but the conceptual framework is the same. 

We could proceed with the general quartic polynomial V (@), but instead let us 
try to answer the motivating question of this chapter: What happens when yz = 0, 
that is, when V (vy) = (1/4!)Ag*? The arithmetic is also simpler. 

Evaluating (16) we get 


22 


(én? log 7 - 4 C)y* + 0107) 


Ven) = (5 a+ By? +(Dh+ 


(after absorbing some g-independent constants into C). We see explicitly that the 
A dependence can be absorbed into B and C. 

We started out with a purely quartic V(g). Quantum fluctuations generate a 
quadratically divergent y* term that we can cancel with the B counterterm. What 
does 4 = 0 mean? It means that (d*V/dg*)|,-9 vanishes. To say that we have 
a 24 =0 theory means that we have to maintain a vanishing renormalized mass 
squared, defined here as the coefficient of gy. Thus, we impose our first condition 
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d* Var 


id ~0 (17) 


g=0 


This is a somewhat long-winded way of saying that we want B = —(A?/6417)A 
to this order. 

Similarly, we might think that the second condition would be to set 
(d*V¢/dg*)|o—0 equal to some coupling, but differentiating the y* log g term 
in Vee four times we are going to get a term like log gy, which is not defined at 
g = 0. We are forced to impose our condition on d*V,/dg* not at g = 0 but at 
g equal to some arbitrarily chosen mass M. (Recall that g has the dimension of 
mass.) Thus, the second condition reads 


d* Ve 


ies =1(M) (18) 


g=M 
where 4.(M) is a coupling manifestly dependent on M. 
Plugging 


Vee(g) = (LA n° 
fe A 6x2 


2 
Y 4 3 


into (18) we see that A(M) is equal to A plus O(A”) corrections, among which is 
a term like A? log M. We can get a clean relation by differentiating 1(M/) : 


dM) 3 2 3 
aM i6n2” + OW) 
~ 3 yey)? 3 
= eqn) + Of[A(M)’} (19) 


where the second equality is correct to the order indicated. This interesting relation 
tells us how the coupling A.(M) depends on the mass scale M at which it is defined. 
Recall Exercise III.1.3. We will come back to this relation in Chapter VI.7 on the 
renormalization group. 

Meanwhile, let us press on. Using (18) to determine C and plugging it into Vere 
we obtain 


ACM)? ‘( gy -3) 3 
Cem” aps + OfA(M)*] (20) 


1 
Ver (9) = MMe" + 
You are no longer surprised, I suppose, that C and the cutoff A have both disap- 
peared. That’s a renormalizable theory for you! 
The fact that V.¢¢ does not depend on the arbitrarily chosen M, namely, 
M (dV .¢/dM) = 0, reproduces (19) to the order indicated. 
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Breaking by quantum fluctuations 


Now we can answer the motivating question: To break or not to break? 

Quantum fluctuations generate a correction to the potential of the form 
+4 log y?, but log gy? is whopping big and negative for small gv! The O(h) 
correction overwhelms the classical O(#°) potential +g* near gv = 0. Quantum 
fluctuations break the discrete symmetry g > —¢. 

It is easy enough to determine the minima +¢,,;, Of Vag (gy) (which you should 
plot as a function of ¢ to get a feeling for). But closer inspection shows us that we 
cannot take the precise value of ¢y;, seriously; V.¢ has the form Ag*(1+ A log g + 
---) suggesting that the expansion parameter is actually A log g. [Try to convince 
yourself that (- - -) starts with (A log v)*.] The minima g,;, of Ves clearly occurs 
when the expansion parameter is of order unity. In an exercise in Chapter IV.7 you 
will see a clever way of getting around this problem. 


Fermions 


In (11) g, plays the role of an external field while ¢ corresponds to a quantum field 
we integrate over. The role of g can also be played by a fermion field y. Consider 
adding ¥ (ig — m — fy)y to the Lagrangian. In the path integral 


Z = | DoD Dye! | F214 0eP -VO+bEI-m— For (21) 
we can always choose to integrate over 7 first, obtaining 
Z= | Dee! f a4x14(8¢)?—V (gy) Htr log(ig—m— f¢) (22) 


Repeating the steps in (13) we find that the fermion field contributes 


p—m— fo 
a i 23 
( p ”) 


to V.g(g). (The trace in (23) is taken over the gamma matrices.) Again from 
Chapter II.5, we see that physically V-(¢) represents the vacuum energy of a 
fermion with the effective mass m(y) =m + fq. 

We can massage the trace of the logarithm using tr log M = log det M (IL5.12) 
and cyclically permuting factors in a determinant): 


d*p 
V = +i tr] 
F@) +i f ons og 


tr log(p — a) = tr log y°(p — a)y? = tr log(— p — a) 
= 5 tr(log( p — a) + log( p + a)) + 4 tr log(—1) 


= } trlog(—1)(p* — a’). (24) 
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Hence, 
_ 2_ 42 2_ 42 
trlog P= 2 —  triog 2 =~ = 2log 2 —* (25) 
and so 
_[ dp, p?-m@) 
V =2 —— log —____—"— 26 
F(~) if (ny og P (26) 


Contrast the overall sign with the sign in (14): the difference in sign between 
fermionic and bosonic loops was explained in Chapter IT.5. 

Thus, in the end the effective potential generated by the quantum fluctuations 
has a pleasing interpretation: It is just the energy density due to the fluctuating 
energy, entirely analogous to the zero point energy of the harmonic oscillator, of 
quantum fields living in the background ¢ (see Exercise IV.3.5). 


Exercises 


IV.3.1. Consider the effective potential in (0 + 1)-dimensional spacetime: 


hf dk kL 4+ VV") 
Veg(g) = Vv) + = £ log 4 __—— 


) On 
25 Qn) ki. FOG) 


No counterterm is needed since the integral is perfectly convergent. But 
(0 + 1)-dimensional field theory is just quantum mechanics. Evaluate the inte- 
gral and show that V,5- is in complete accordance with your knowledge of quantum 
mechanics. 


IV.3.2. Study Veg in (1+ 1)—dimensional spacetime. 


IV.3.3. Consider a massless fermion field % coupled to a scalar field y by foyy in 
(1+ 1)-dimensional spacetime. Show that 


Vp = (fF 9)? lo a (27) 
Pon IPP 8 yp 


after a suitable counterterm has been added. This result is important in condensed 
matter physics, as we will see in Chapter V.5 on the Peierls instability. 


IV.3.4. Understand (14) using Feynman diagrams. Show that V.g is generated by an 
infinite number of diagrams. [Hint: Expand the logarithm in (14) as a series in 
V"(g)/k* and try to associate a Feynman diagram with each term in the series. ] 


IV.3.5. Consider the electrodynamics of a complex scalar field 
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£ =] FF +[@" + ied)" [(0, — ied, 9] 


. 
+ 29' 9 —My' yy (28) 


Ina universe suffused with the scalar field g(x) taking on the value g independent 
of x as in the text, the Lagrangian will contain a term (e*y'p)A,,A“ so that the 
effective mass squared of the photon field becomes M(g)* = e?g'g. Show that 
its contribution to V,(g) has the form 


d*k lo k* — M(o)? 
(22)4 k? 


(29) 


Compare with (14) and (26). [Hint: Use the Landau gauge to simplify the calcu- 
lation.] If you need help, I strongly urge you to read S. Coleman and E. Weinberg, 
Phys. Rev. D7: 1883, 1973, a paragon of clarity In exposition. 


Chapter IV.4 
Magnetic Monopole 


Quantum mechanics and magnetic monopoles 


Curiously enough, while electric charges are commonplace nobody has ever seen 
a magnetic charge or monopole. Within classical physics we can perfectly well 
modify one of Maxwell’s equations to V-B= Py, with px denoting the density 
of magnetic monopoles. The only price we have to pay is that the magnetic field B 
can no longer be represented as B=V x Asince otherwise V-B=V-V x A= 

€;;,9;0; A, = 0 identically. Newton and Leibniz told us that derivatives commute 
with each other. 

So what, you say. Indeed, who cares that B cannot be written as V x A? The 
vector potential A was introduced into physics only as a mathematical crutch, 
and indeed that is still how students are often taught in a course on classical 
electromagnetism. As the distinguished nineteenth-century physicist Heaviside 
thundered, “Physics should be purged of such rubbish as the scalar and vector 
potentials; only the fields E and B are physical.” 

With the advent of quantum mechanics, however, Heaviside was proved to be 
quite wrong. Recall, for example, the nonrelativistic Schrodinger equation for a 
charged particle in an electromagnetic field: 


ne —ieA)? + 6] y=Ep (1) 
2m 


Charged particles couple directly to the vector and scalar potentials Aand @, which 
are thus seen as being more fundamental, in some sense, than the electromagnetic 
fields E and B, as I alluded to in Chapter III.4. Quantum physics demands the 
vector potential. 

Dirac noted brilliantly that these remarks imply an intrinsic conflict between 
quantum mechanics and the concept of magnetic monopoles. Upon closer analy- 
sis, he found that quantum mechanics does not actually forbid the existence of 
magnetic monopoles. It allows magnetic monopoles, but only those carrying a 
specific amount of magnetic charge. 
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Differential forms 


For the following discussion and for the next chapter on Yang-Mills theory, it is 
highly convenient to use the language of differential forms. Fear not, we will need 
only a few elementary concepts. Let x” be D real variables (thus, the index yu 
takes on D values) and A,, (not necessarily the electromagnetic gauge potential in 
this purely mathematical section) be D functions of the x’s. In our applications, x 
represent coordinates and, as we will see, differential forms have natural geometric 
interpretations. 

We call the object A = A,,dx" a 1-form. The differentials dx” are treated 
following Newton and Leibniz. If we change coordinates x — x’, then as usual 
dx" = (ax"/dx'")dx” so that A= A,dx" = A, (0x"/dx”)dx” = A\dx'”. This 
reproduces the standard transformation law of vectors under coordinate trans- 
formation A’, = A,,(8x"/dx""). As an example, consider A = cos @ dg. Regard- 
ing @ and g as angular coordinates on a 2-sphere (namely the surface of a 3- 
ball), we have Ag =0 and A, =cos @. Similarly, we define a p-form as H = 
/P) Ay yyg---u,dx"'dx ---dx"r, (Repeated indices are summed, as always.) 
The “degenerate” example is that of a 0-form, cali it A, which is just a scalar 
function of the coordinates x”. An example of a 2-formis F = (1/2!) F,,dx"dx”. 

We now face the question of how to think about products of differentials. 
In an elementary course on calculus we learned that dx dy represents the area 
of an infinitesimal rectangle with length dx and width dy. At that level, we 
more or less automatically regard dy dx as the same as dx dy. The order of 
writing the differentials does not matter. However, think about making a coordinate 
transformation so that x = x(x’, y’) and y = y(x’, y’) are now functions of the new 
coordinates x’ and y’. Now look at 


ax ox oy oy 
dx dy = | —dx' + —d ‘ (24 ‘4 —« ‘ 2 
vay (Fe ax! + ay ax’ * + ay @) 


Note that the coefficient of dx’dy’ is (0x /0x’)(8y/dy’) and that the coefficient of 
dy'dx' is (8x/dy')(dy/dx'). We see that it is much better if we regard the dif- 
ferentials dx" as anticommuting objects [what mathematicians would call Grass- 
mann variables (recall Chapter (II.5)] so that dy’dx' = —dx’dy’ and dx'dx’ =0= 
dy'dy'. Then (2) simplifies neatly to 


ox a ax a 
dx dy = (so — wae) dx'dy'’ = J (x, y,x', y)dx'dy’ (3) 


We obtain the correct Jacobian J(x, y; x’, y’) for transforming the area element 
dx dy to the area element dx’dy’. 

In many texts, dx dy is written as dx“dy. We will omit the wedge—no reason 
to clutter up the page. 

This little exercise tells us that we should define dx"dx” = —dx’dx" and 
regard the area element dx“dx” as directional. The area elements dx"dx” and 
dx’dx" have the same magnitude but point in opposite directions. 
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We now define a differential operation d to act on any form. Acting on a p-form 
Hi, it gives by definition 


1 


dH = pie ana upd axe idx ---dxte 


Thus, dA = a, Adx” and 
dA =0,A,dx"dx" = 3(d,A, — 3, Ay)dx"dx" 


We see that this mathematical formalism is almost tailor made to describe 
electromagnetism. If we call A = A,,dx" the potential 1-form and think of A,, 
as the electromagnetic potential, then F = dA is in fact the field 2-form. If we 
write F out in terms of its components F = (1/2!) F,,,dx"dx", then F,,, is indeed 
the electromagnetic field. 

Note that x" is not a form, and dx is not d acting on a form. 

If you like, you can think of differential forms as “merely” an elegantly compact 
notation. The point is to think of physical objects such as A and F as entities, 
without having to commit to any particular coordinate system. This is particularly 
convenient when one has to deal with objects more complicated than A and F, for 
example in string theory. By using differential forms, we avoid drowning in a sea 
of indices. 

An important identity is 


dd =0 (4) 


which says that acting with d on any form twice gives zero. Verify this as an 
exercise. In particular dF = dd A = 0. If you write this out in components you will 
recognize it as a standard identity (the “Bianchi identity”) in electromagnetism. 


Closed is not necessarily globally exact 


It is convenient here to introduce some jargon. A p-form a is said to be closed if 
da = (). It is said to be exact if there exists a (p — 1)-form 8 such that a = dB. 
Talking the talk, we say that (4) tells us that exact forms are closed. 
Is the converse of (4) true? Kind of. The Poincaré lemma states that a closed 
form is locally exact. In other words, if dH = 0 with H some p-form, then locally 


H=dK (5) 


for some (p — 1)-form K. However, it may or may not be the case that H = dK 
globally, that is, everywhere. Actually, whether you know it or not, you are already 
familiar with the Poincaré lemma. For example, surely you learned somewhere that 
if the curl of a vector field vanishes, the vector field is locally the gradient of some 
scalar field. 

Forms are ready made to be integrated over. For example, given the 2-form 
F = (1/2) F,,,dx"dx", we can write tu F for any 2-manifold M. Note that the 
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measure is already included and there is no need to specify a coordinate choice. 
Again, whether you know it or not, you are already familiar with the important 
theorem 


| dH = H (6) 
M aM 


with H a p-form and 0M the boundary of a (p + 1)-dimensionai manifold M. 


Dirac quantization of magnetic charge 


After this dose of mathematics, we are ready to do some physics. Consider a sphere 
surrounding a magnetic monopole with magnetic charge g. Then the electromag- 
netic field 2-form is given by F = (g/42)dcos 6 dg. This is almost a definition 
of what we mean by a magnetic monopole (see Exercise IV.4.3.) In particular, 
calculate the magnetic flux by integrating F over the sphere S* 


[irre (7) 


As I have already noted, the area element is automatically included. Indeed, you 
might have recognized dcos 6 dy = — sin 6 d@ d¢ as precisely the area element 
on a unit sphere. Note that in “ordinary notation” (7) implies the magnetic field 
B = (g/4nr?)r, with? the unit vector in the radial direction. 

I will now give a rather mathematical, but rigorous, derivation, originally 
developed by Wu and Yang, of Dirac’s quantization of the magnetic charge g. 

First, let us recall how gauge invariance works, from, for example, (II.7.3). Un- 
der a transformation of the electron field y (x) > e'4@ w(x), the electromagnetic 
gauge potential changes by 


1 _, , 
A, (x) _ Au (x) + oo Maye NO 
or in the language of forms, 
1 ¥,..; 
A> A+—eAdeiA (8) 
ie 
Differentiating, we can of course write 
1 
Ay) > Ay) + 5 eA) 
as is commonly done. The form given here reminds us that gauge transformation 
is defined as multiplication by a phase factor e'A@) so that A(x) and A(x) + 27 
describe exactly the same transformation. 
In quantum mechanics A is physical, pace Heaviside, and so we should ask 


what A would give rise to F = (g/4:) d cos 0 dg. Easy, you say; clearly A = 
(g/42r) cos 6 dg. (In checking this by calculating dA, remember that dd = 0.) 
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But not so fast; your mathematician friend says that dg is not defined at the north 
and south poles. Put his objection into everyday language: If you are standing on 
the north pole, what is your longitude? So strictly speaking it is forbidden to write 
A =(g/41) cos 6 dg. 

But, you are smart enough to counter, then what about Ay = (g/47r)(cos @ — 1) 
dg, eh? When you act with d on Ay you obtain the desired F’; the added piece 
(g/42)(—) dg gets annihilated by d thanks once again to the identity (4). 

OK, but your mathematician friend points out that your Ay is not defined at the 
south pole. 

Right, you respond, I anticipated that by adding the subscript N. Iam now also 
forced to define As = (g/42r)(cos @ + 1) dg. Note that d acting on Ax again gives 
the desired F'. But now As is defined everywhere except at the north pole. 

In mathematical jargon, we say that the gauge potential A is defined locally, but 
not globally. The gauge potential A, is defined on a “coordinate patch” covering 
the northern hemisphere and extending past the equator as far south as we want as 
long as we do not include the south pole. Similarly, A> is defined on a “coordinate 
patch” covering the southern hemisphere and extending past the equator as far 
north as we want as long as we do not include the north pole. 

But what happens where the two coordinate patches overlap, for example, along 
the equator. The gauge potentials A, and As are not the same: 


As — Ay =2—dg (9) 
4x 


Now what? Aha, but this is a gauge theory: If As and Ay are related by a 
gauge transformation, then all is well. Thus, referring to (8) we require that 
2(g/42) dy = (1/ie)e‘Sde'“ for some phase function e'“. By inspection we 
have eA _— ei2les/4m)9 | 

But g = Oand g = 27 describe exactly the same point. In order for e’“ to make 
sense, we must have e!2(¢8/47) 27) — ¢i2(eg/4x)0) — 1; in other words, e'€ = 1, or 


g= an (10) 
e 


where n denotes an integer. This is Dirac’s famous discovery that the magnetic 
charge on a magnetic monopole is quantized in units of 27/e. A “dual” way of 
putting this is that if the monopole exists then.electric charge is quantized in units 
of 27 /g. 

Note that the whole point is that F is locally but not globally exact; otherwise 
by (6) the magnetic charge g = f,. F would be zero. 

I show you this rigorous mathematical derivation partly to cut through a lot of 
the confusion typical of the derivations in elementary texts and partly because this 
type of argument is used repeatedly in more advanced areas of physics, such as 
string theory. 
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Electromagnetic duality 


That a duality may exist between electric and magnetic fields has tantalized theo- 
retical physicists for a century and a half. By the way, if you read Maxwell, you will 
discover that he often talked about magnetic charges. You can check that Maxwell’ S 
equations are invariant under the elegant transformation (E +iB y—> elf (E +i B ) 
if magnetic charges exist. 

One intriguing feature of (10) is that if e is small, then g is large, and vice 
versa. What would magnetic charges look like if they exist? They wouldn’t look 
any different from electric charges: They too interact with a 1/r potential, with likes 
repelling and opposites attracting. In principle, we could have perfectly formulated 
electromagnetism in terms of magnetic charges, with magnetic and electric fields 
exchanging their roles, but the theory would be strongly coupled, with the coupling 
g rather than e. 

Theoretical physicists are interested in duality because it allows them a glimpse 
into field theories in the strongly coupled regime. Under duality, a weakly coupled 
field theory is mapped into a strongly coupled field theory. This is exactly the 
reason why the discovery several years ago that certain string theories are dual 
to others caused such enormous excitement in the string theory community: We 
get to know how string theories behave in the strongly coupled regime. More on 
duality in Chapter VI.3. 


Forms and geometry 


The geometric character of differential forms is further clarified by thinking about 
the electromagnetic current of a charged particle tracing out the world line X"(r) 
in D-dimensional spacetime (see Figure IV.4.1a): 


J¥(x) = f a3 — X(t)] (11) 
T 


The interpretation of this elementary formula from electromagnetism is clear: 
dX" /dt is the 4-velocity at a given value of the parameter t (“proper time”) and 
the delta function ensures that the current at x vanishes unless the particle passes 
through x. Note that J“(x) is invariant under the reparametrization t > t’(r). 

The generalization to an extended object is more or less obvious. Consider a 
string. It traces out a world sheet X“(r, o) in spacetime (see Figure 1b), where o 
is a parameter telling us where we are along the length of the string. [For example, 
for a closed string, o is conventionally taken to range between 0 and 27 with 
X"#(r, 0) = X(t, 27).] The current associated with the string is evidently given 
by 


iW v 
J’’(x) = | drdo det hal oe) oOte—X,o)1 12) 
a,X" a,X 
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x" (1) xH(g,1) 


(a) (b) 


Figure IV.4.1 


where 0, = 0/dt and so forth. The determinant is forced on us by the requirement 
of invariance under reparametrization t > t'(t,0),0 > o’(t, a). It follows that 
J”” is an antisymmetric tensor. Hence, the analog of the electromagnetic potential 
A,, coupling to the current J is an antisymmetric tensor field B,,,, coupling to the 
current J“”. Thus, string theory contains a 2-form potential B = 5 B,,,dx"dx” and 
the corresponding 3-form field H = dB. In fact, string theory typically contains 
numerous p-forms. 


Aharonov-Bohm effect 


The reality of the gauge potential A was brought home forcefully in 1959 by 
Aharonov and Bohm. Consider a magnetic field B confined to a region Q as 
illustrated in Figure IV.4.2. The quantum physics of an electron is described by 
solving the Schrodinger equation (1). In Feynman’s path integral formalism the 
amplitude associated with a path P is modified by a multiplicative factor e’® Sp Ade 
where the line integral is evaluated along the path P. Thus, in the path integral 
calculation of the probability for an electron to propagate from a to b (Fig. IV.4.2), 
there will be interference between the contributions from path 1 and path 2 of the 
form 


Gi Sp, “ey (e' Sr, say _ (clef Aa) 


but § A-d¥ = [ B-dS is precisely the flux enclosed by the closed curve 
(P; — Pz). Remarkably, the electron feels the effect of the magnetic field even 
though it never wanders into a region with a magnetic field present. 
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VA 


2 B=0 


Figure IV.4.2 


When the Aharonov-Bohm paper was first published, no less an authority 
than Niels Bohr was deeply disturbed. The effect has since been conclusively 
demonstrated in a series of beautiful experiments by Tonomura and collaborators. 

Coleman once told of a gedanken prank that connects the Aharonov-Bohm 
effect to Dirac quantization of magnetic charge. Let us fabricate an extremely thin 
solenoid so that itis essentially invisible and thread it into the lab of an unsuspecting 
experimentalist, perhaps our friend from Chapter III.1. We turn on a current and 
generate a magnetic field through the solenoid. When the experimentalist suddenly 
sees the magnetic flux coming out of apparently nowhere, she gets so excited that 
she starts planning to go to Stockholm. 

What is the condition that prevents the experimentalist from discovering the 
prank? A careful experimentalist might start scattering electrons around to see if 
she can detect a solenoid. The condition that she does not see an Aharonov-Bohm 
effect and thus does not discover the prank is precisely that the flux going through 
the solenoid is an integer times 27/e. This implies that the apparent magnetic 
monopole has precisely the magnetic charge predicted by Dirac! 


Exercises 


IV.4.1. Prove dd = 0. 


IV.4.2. Show by writing out the components explicitly that dF = 0 expresses something 
that you are familiar with but disguised in a compact notation. 


IV.4.3. Consider F = (g/47) d cos @ dg. By transforming to Cartesian coordinates show 
that this describes a magnetic field pointing outward along the radial direction. 


IV.4.4. Restore the factors of # and c in Dirac’s quantization condition. 
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IV.4.5. Write down the reparametrization-invariant current J“”* of a membrane. 


IV.4.6. Let g(x) be the element of a group G. The 1-form v = gdg’ is known as the 
Cartan-Maurer form. Then tr v’’ is trivially closed on an N-dimensional manifold 
since itis already an N-form. Consider Q = fy tr v% with S% the N-dimensional 
sphere. Discuss the topological meaning of Q. These considerations will become 
important later when we discuss topology in field theory in Chapter V.7. [Hint: 
Study the case N = 3 and G = SU (2).] 


Chapter I'V.5 
Nonabelian Gauge Theory 


Most such ideas are eventually discarded or shelved. But some persist and 
may become obsessions. Occasionally an obsession does finally turn out 
to be something good. 
—C.N. Yang talking about an idea that he first had as a 
student and that he kept coming back to year after year.! 


Local transformation 


It was quite a nice little idea. 

To explain the idea Yang was talking about, recall our discussion of symmetry 
in Chapter I.9. For the sake of definiteness let p(x) = {g) (x), g2(X), ---, @n(x)} 
be an N-component complex scalar field transforming as g(x) > Ug(x), with U 
an element of SU(N). Since g' > y*U* and U'U = 1, we have y'g — g' and 
aptay — ay‘ dg. The invariance of the Lagrangian & —3y'ay — V(o'¢) under 
SU(N) is obvious for any polynomial V. 

In the theoretical physics community there are many more people who can 
answer well-posed questions than there are people who can pose the truly important 
questions. The latter type of physicist can invariably also do much of what the 
former type can do, but the reverse is certainly not true. 

In 1954 C.N. Yang and R. Mills asked what will happen if the transformation 
varies from place to place in spacetime, or in other words, if U = U (x) is a function 
of x. 

Clearly, y'g is still invariant. But in contrast dg'@g is no longer invariant. 
Indeed, 


8.9 > 9,,(Ug) = Ua, + (4,U)9 = ULA,g + (U4, U)¢] 


To cancel the unwanted term (U fa, U), we generalize the ordinary derivative 4, 
to a covariant derivative D,,, which when acting on ¢, gives 


Dj, Q(X) = O,9(*) — 1Ay (xX) 9) (1) 


'C.N. Yang, Selected Papers 1945-1980 with Commentary, p. 19. 
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The field A,, is called a gauge potential in direct analogy with electromagnetism. 
How must A,, transform, so that D u(x) > U(x) D,, p(x)? In other words, we 
would like D,,g(x) to transform the way u?(x) transformed when U did not 
depend on x. If so, then [D,,9(«)]' D,, g(x) — [D,,g(«)]'D,, (x) and can be used 
as an invariant kinetic energy term for the field g. 
Working backward, we see that D,, p(x) > U(x)D uP (x) if (and it goes without 
saying that you should be checking this) 


A, -> UA,,Ut — i(8,U)U! =UA,U' +iUa,Ut (2) 


(The equality follows from UU? = 1.) We refer to A » as the nonabelian gauge 
potential and to (2) as a nonabelian gauge transformation. 
Let us now make a series of simple observations. 


1. Clearly, A, have to be N by N matrices. Work out the transformation law 
for Al using (2) and show that the condition A in Al = 0 is preserved by 
the gauge transformation. Thus, it is consistent to take A,, to be hermitean. 

. Specifically, you should work out what this means for the group SU(2) so 
that U = e'?*/? where 6 - t = 6%r, with r“ the familiar Pauli matrices. 

2. Writing U =e’? with T? the generators of SU(N), we have 


Ay > A, +i0(T?, Ay J+ 9,0°T? (3) 


under an infinitesimal transformation U ~ 1+ i. 7. For most purposes, 

the infinitesimal form (3) suffices. 

3. Taking the trace of (3) we see that the trace of A,, does not transform and 
so we can take A,, to be traceless as well as hermitean. This means that we 
can always write A,, = A’ T“ and thus decompose the matrix field A,, into 
component fields Al There are as many A®’s as there are generators in the 
group [3 for SU (2), 8 for SU (3), and so forth.] 

4. You are reminded in Appendix B that the Lie algebra of the group is de- 
fined by (7%, T°] =if@°T°, where the numbers f%”¢ are called structure 
constants. For example, f° = ¢%¢ for SU (2). Thus, (3) can be written as 

At > Aa — FPO AS + 3,04 (4) 
Note that if @ does not depend on x, the Ais transform as the adjoint 
representation of the group. 

5. If U(x) =e! is just an element of the abelian group U(1), all these 
expressions simplify and A_, is just the abelian gauge potential familiar from 
electromagnetism, with (2 ) the usual abelian gauge transformation. Hence, 
A,, is known as the nonabelian gauge potential. 

A transformation U that depends on the spacetime coordinates x is known as 

a gauge transformation or local transformation. A Lagrangian & invariant under a 

gauge transformation is said to be gauge invariant. 
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Construction of the field strength 
We can now immediately write a gauge invariant Lagrangian, namely 


£ =(D,9)'(D,9) — V¢"'g) (5) 


but the gauge potential A,, does not yet have dynamics of its own. In the familiar 
example of U(1) gauge invariance, we have written the coupling of the electro- 
magnetic potential A,, to the matter field g, but we have yet to write the Maxwell 
term —;F,,,F"* in the Lagrangian. Our first task is to construct a field strength 
F,,, out of A,,. How do we do that? Yang and Mills apparently did it by trial and 
error. As an exercise you might also want to try that before reading on. 

At this point the language of differential forms introduced in Chapter IV.4 
proves to be of use. It is convenient to absorb a factor of —i by defining Au = 
—iA", where A? denotes the gauge potential we have been using all along. Until 
further notice, when we write A,, we mean AN . Referring to (1) we see that 
the covariant derivative has the cleaner form Dy = Ou +A us (Incidentally, the 
superscripts M and P indicate the potential appearing in the mathematical and 
physical literature, respectively.) As before, let us introduce A = A,,dx", now 
a matrix 1-form, that is, a form that also happens to be a matrix in the defining 
representation of the Lie algebra [e.g., an N by N traceless hermitean matrix for 
SU (N).] Note that 


A? =A, A,dx"dx" = s[A,, Ay]dx"dx” 
is. not zero for a nonabelian gauge potential. (Obviously, there is no such object in 
electromagnetism.) 
Our task is to construct a 2-form F = }F,,,dx"dx” out of the 1-form A. We 
adopt a direct approach. Out of A we can construct only two possible 2-forms: dA 


and A”. So F must be a linear combination of the two. 
In the notation we are using the transformation law (2) reads 


A UAU' +UaU' (6) 
with U a 0-form (and so dU = 0,U "dx" ) Applying d to (6) we have 
dA» UdAU'+dUAU' —UAdU' + dUdU" (7) 


Note the minus sign in the third term, from moving the 1-form d past the 1-form 
A. On the other hand, squaring (6) we have 


A? > UA2U' + UAdU' +UdU UAU' +UdU'UdU" (8) 
Applying d to UU’ = 1 we have UdU' = —dUU  . Thus, we can rewrite (8) as 


A? > UA2U' + UAdU' — dU AU — dUdU' (9) 
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Lo and behold! If we add (7) and (9), six terms knock each other off, leaving 
us with something nice and clean: 


dA+A*—->U(dA+A2)U (10) 
The mathematical structure thus led Yang and Mills to define the field strength 
F=dA+ A? (11) 

Unlike A, the field strength 2-form F transforms homogeneously (10): 


F—>UFU (12) 


In the abelian case A? vanishes and F reduces to the usual electromagnetic form. 
In the nonabelian case, F is not gauge invariant, but gauge covariant. 

Of course, you can also construct F? without using differential forms. As an 
exercise you should do it starting with (4). The exercise will make you appreciate 
differential forms! At the very least, we can regard differential forms as an elegantly 
compact notation that suppresses the indices a and yz in (4). At the same time, 
the fact that (11) emerges so smoothly clearly indicates a profound underlying 
mathematical structure. Indeed, there is a one-to-one translation between the 
physicist’s language of gauge theory and the mathematician’s language of fiber 
bundles. 

Let me show you another route to (11). In analogy to d, define D=d + A, 
understood as an operator acting on a form to its right. Let us calculate 


D’=(d+A)\d+A)=d+4+dA+Ad+ A? 


The first term vanishes, the second can be written as dA = (dA) — Ad; the paren- 
thesis emphasizes that d acts only on A. Thus, 


D?=(dA)+A7=F (13) 


Pretty slick? I leave it as an exercise for you to show that D? transforms homoge- 
neously and hence so does F. 

Elegant though differential forms are, in physics it is often desirable to write 
more explicit formulas. We can write (11) out as 


F = (0,A,+ A, A,)dx"dx" = 3(0,A, — 8Ay + [A,,, Aydx"dx” (14) 


With the definition F = 1F.,,dx“dx" we have 


= iF yy 
Fy =9,Ay — OA, +A), Ay] (15) 
At this point, we might also want to switch back to physicist’s notation. Recall 


that A,, in (15) is actually Au = —iA/, and so by analogy define FY =—iF/. 
Thus, 


Fy, =9,A, — A, —i[A,, Ay] (16) 
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where, until further notice, A,, stands for Al . (One way to see the necessity for 
the i in (16) is to remember that physicists like to take A, to be a hermitean matrix 
and the commutator of two hermitean matrices is antihermitean.) 

As long as we are being explicit we might as well go all the way and exhibit the 
group indices as well as the Lorentz indices. We already wrote A,, = Aj, T® and 
so we naturally write F,,,, = F, jive Then (16) becomes 


Fa, = 8,A5 — a,Aa + fV°Ar AC (17) 


I mention in passing that for SU (2) A and F transform as vectors and the structure 
constant f?"° is just e%"°, so the vector notation Fy = 0, A, —d, A, + A, x A, 
is often used. 


The Yang-Mills Lagrangian 


Given that F transforms homogeneously (12) we can immediately write down the 
analog of the Maxwell Lagrangian, namely the Yang-Mills Lagrangian 


1 
b=-750 F FY (18) 


We are normalizing T? by tr T¢T° = 167° so that & = —(1/4g7) Fa Fae, The 
theory described by this Lagrangian is known as pure Yang-Mills theory or non- 
abelian gauge theory. 

Apart from the quadratic term (0,,A) ~— 4 Ay’ , the Lagrangian £ = 
—(1/48°) Fry F#” also contains a cubic term f7°A?# A°"(a,,A% — 2 yA’) and 
a quartic term (f ape ab Ac “y. As in electromagnetism the quadratic | term describes 
the propagation of a massless vector boson carrying an internal index a, known 
as the nonabelian gauge boson or the Yang-Mills boson. The cubic and quartic 
terms are not present in electromagnetism and describe the self-interaction of the 
nonabelian gauge boson. The corresponding Feynman rules are given in Figure 
IV.5.1a, 1b, and c. 

The physics behind this self-interaction of the Yang-Mills bosons is not hard 
to understand. The photon couples to charged fields but is not charged itself. 
Just as the charge of a field tells us how the field transforms under the U (1) 
gauge group, the analog of the charge of a field in a nonabelian gauge theory 
is the representation the field belongs to. The Yang-Mills bosons couple to all 
fields transforming nontrivially under the gauge group. But the Yang-Mills bosons 
themselves transform nontrivially: In fact, as we have noted, they transform under 
the adjoint representation. Thus, they must couple to themselves. 

Pure Maxwell theory is free and so essentially trivial. It contains a noninter- 
acting photon. In contrast, pure Yang-Mills theory contains self-interaction and is 
highly nontrivial. Note that the structure coefficients f?°° are completely fixed by 
group theory, and thus in contrast to a scalar field theory, the cubic and quartic 
self-interactions of the gauge bosons, including their relative strengths, are totally 


IV.5, Nonabelian Gauge Theory 231 


Wavavavavivausvavavala 
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Figure IV.5.1 


fixed by symmetry. If any 4-dimensional field theory can be solved exactly, pure 
Yang-Mills theory may be it, but in spite of the enormous amount of theoretical 
work devoted to it, it remains unsolved (see Chapters VII.3 and VII.4). 


’t Hooft’s double-line formalism 


While it is convenient to use the component fields At for many purposes, the matrix 
field A, = Aure embodies the mathematical structure of nonabelian gauge theory 
more elegantly. The propagator for the components of the matrix field in a U(N) 
gauge theory has the form 


(0| TA,(x),A,(0)F |0) 
= (0| TAS (x) APO) 10)(77),( °F (19) 
a Beery (Ty a 58 


The matrix structure Al j naturally suggests that we, following ’t Hooft, introduce 
a double-line formalism, in which the gauge potential is described by two lines, 
each associated with one of the two indices i and j. We choose the convention 
that the upper index flows into the diagram, while the lower index flows out of the 
diagram. The propagator in (19) is represented in Figure IV.5.2a. The double-line 
formalism allows us to reproduce the index structure 5,54 naturally. The cubic and 
quartic couplings are represented in Figure IV.5.2b and c. 

‘Tre constant g introduced in (18) is known as the Yang-Mills coupling constant. 
We can always write the quadratic term in (18) in the convention commonly used 
in electromagnetism by a trivial rescaling A —> gA. After this rescaling, the cubic 
and quartic couplings of the Yang-Mills boson go as g and g’, respectively. The 


covariant derivative in (1) becomes D,,p = 0,9 — igA,,@, showing that g also 
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A SK 


{b) {c) 


Figure IV.5.2 


measures the coupling of the Yang-Mills boson to matter. The convention we 
used, however, brings out the mathematical structure more clearly. As written 
in (18), g” measures the ease with which the Yang-Mills boson can propagate. 
Recall that in Chapter III.7 we also found this way of defining the coupling as a 
measure of propagation useful in electromagnetism. We will see in Chapter VIII.1 
that Newton’s coupling appears in the same way in the Einstein-Hilbert action for 


gravity. 


The 6 term 


Besides tr F,,,F”", we can also form the dimension-4 term e/” tr Fy, Fyp. 
Clearly, this term violates time reversal invariance T and parity P since it involves 
one time index and three space indices. We will see later that the strong interaction 
is described by a nonabelian gauge theory, with the Lagrangian containing the so- 
called 6 term (9/3277)e""” tr F uv yp: AS you will show in Exercise IV.5.3 this 
term is a total divergence and does not contribute to the equation of motion. Nev- 
ertheless, it induces an electric dipole moment for the neutron. The experimental 
upper bound on the electric dipole moment for the neutron translates into an upper 
bound on @ of the order 10~?. I will not go into how particle physicists resolve the 
problem of making sure that @ is small enough or vanishes outright. 


Coupling to matter fields 


We took the scalar field g to transform in the fundamental representation of the 
group. In general, g can transform in an arbitrary representation R of the gauge 
group G. We merely have to write the covariant derivative more generally as 


Dy, = (8, —iASTS))g (20) 
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where TiRy represents the ath generator in the representation R (see Exercise 
IV.5.1). 

Clearly, the prescription to turn a globally symmetric theory into a locally 
symmetric theory is to replace the ordinary derivative @,, acting on any field, 
boson or fermion, belonging to the representation ® by the covariant derivative 
D,=(@,-1 AUT Ry): Thus, the coupling of the nonabelian gauge potential to a 
fermion field is given by 


L=liy"D, —m)b = vliy"d, + yY AL Tay — my. (21) 


Fields listen to the Yang-Mills gauge bosons according to the representation R 
that they belong to, and those that belong to the trivial identity representation do 
not hear the call of the gauge bosons. In the special case of a U(1) gauge theory, 
also known as electromagnetism, R corresponds to the electric charge of the field. 
Those fields that transform trivially under U (1) are electrically neutral. 


Appendix 


Let me show you another context, somewhat surprising at first sight, in which the Yang-Mills 
structure pops up.” Consider Schrédinger’s equation 


i =v) = H(tyw(r) (22) 


with a time dependent Hamiltonian H (t). The setup is completely general: For instance, we 
could be talking about spin states in a magnetic field or about a single particle nonrelativistic 
Hamiltonian with the wave function W (x, t). We suppress the dependence of H and ¥ on 
variables other than time f. 

First, solve the eigenvalue problem of H(t). Suppose that because of symmetry or some 
other reason the spectrum of H(t) contains an n-fold degeneracy, in other words, there 
exist n distinct solutions of the equations H(t)y,(t) = E(t), (t), witha = 1, .--,n. Note 
that E(t) can vary with time and that we are assuming that the degeneracy persists with 
time, that is, the degeneracy does not occur “accidentally” at one instant in time. We can 
always replace H(t) by H(t) — E(t) so that henceforth we have H(t)y,(t) = 0. Also, the 
states can be chosen to be orthogonal so that (y,(1)|¥,(t}) = 5,9. (For notational reasons 
it is convenient to jump back and forth between the Schrodinger and the Dirac notation. To 
make it absolutely clear, we have 


WOO = f deve. ‘)ta(X,t) 


if we are talking about single particle quantum mechanics.) 


2 Wilczek and A. Zee, “Appearance of Gauge Structure in Simple Dynamical Systems,” 
Phys. Rev. Lett. 52:2111, 1984. 
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Let us now study (22) in the adiabatic limit, that is, we assume that the time scale over 
which H (ft) varies is much longer than 1/ AF, where AF denotes the energy gap separating 
the states y,(t) from neighboring states. In that case, if W(t) starts out in the subspace 
spanned by {1/,(t)} it will stay in that subspace and we can write Y(t) = D>, cg) vy (t). 
Plugging this into (22) we obtain immediately }°[(deq/dt) Yra(t) + cg(t) (OW y/dt)] = 0. 
Taking the scalar product with y,(t), we obtain 


dcp 
—_— =— Apc 23 
a » hala (23) 


with the n by # matrix 


ov, 
or 


Asa(t) =i, (O|-——) (24) 
Now suppose somebody else decides to use a different basis, y/(t) = U*_.(t)y,(t), related 
to ours by a unitary transformation. (The complex conjugate on the unitary matrix U is 
just a notational choice so that our final equation will come out looking the same as a 
celebrated equation in the text; see below.) I have also passed to the repeated indices 
summed notation. Differentiate to obtain (dy) /dt) = UF (t)(0p,/dt) + (dU* /dt)y_(t). 
Contracting this with y,*(t) = U,g(t)¥7(t) and multiplying by i, we find 


' 
A’ =UAU't $US (25) 


Suppose the Hamiltonian H(t) depends on d parameters A!, --- , A¢. We vary the parame- 
ters, thus tracing out a path defined by {A“(r), u = 1, - --, d} in the d-dimensional parameter 
space. For example, for a spin Hamiltonian, {A“} could represent an external magnetic field. 
Now (23) becomes 


dc d)\¥ 
7 =— AW nata— (26) 
a 
if we define (Ay)pq = 1 (W419, Wa), where 0,, = 0/00", and (25) generalizes to 
Ai, =UA,U' +iU4,U" (27) 


We have recovered (IV.5.2). Lo and behold, a Yang-Mills gauge potential A,, has popped 
up in front of our very eyes! 

The “transport” equation (26) can be formally solved by writing c(A) = Pe. f Aya” 
where the line integral is over a path connecting an initial point in the parameter space 
to some final point A and P denotes a path ordering operation. We break the path into 
infinitesimal segments and multiply together the noncommuting contribution e~4«4*" from 
each segment, ordered along the path. In particular, if the path is a closed curve, by the 
time we return to the initial values of the parameters, the wave function will have acquired 
a matrix phase factor, known as the nonabelian Berry’s phase. This discussion is clearly 
intimately related to the discussion of the Aharonov-Bohm phase in the preceding chapter. 
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To see this nonabelian phase, all we have to do is to find some quantum system with 
degeneracy in its spectrum and vary some external parameter such as a magnetic field In 
their paper, Yang and Mills spoke of the degeneracy of the proton and neutron under isospin 
in an idealized world and imagined transporting a proton from one point in the universe to 
another. That a proton at one point can be interpreted as a neutron at another necessitates the 
introduction of a nonabelian gauge potential. I find it amusing that this imagined transport 
can now be realized analogously in the laboratory. 

You will realize that the discussion here parallels the discussion in the text leading up 
to (IV.5.2). The spacetime dependent symmetry transformation corresponds to a parameter 
dependent change of basis. When I discuss gravity in Chapter VIIIL.1 it will become clear 
that moving the basis {ya} around in the parameter space is the precise analog of parallel 
transporting a local coordinate frame in differential geometry and general relativity. We will 
also encounter the quantity Pe™ f Ayane again in Chapter VII.1 in the guise of a Wilson 
loop. 


Exercises 


IV.5.1. Write down the Lagrangian of an SU (2) gauge theory with a scalar field in the 
7 = 2 representation. 


IV.5.2. Prove the Bianchi identity DF =dF+[A, F]=0. Write this out explicitly with 
indices and show that in the abelian case it reduces to half of Maxwell’s equations. 


IV.5.3. In 4-dimensions e“”’? tr F,,,, Fi) can be written as tr F? . Show that d tr F* = 0 
in any dimensions. 


IV.5.4. Invoking the Poincaré lemma (IV.4.5) and the result of Exercise I'V.5.3 show that 
tr F* =d tr(AdA + 2%). Write this last equation out explicitly with indices. 
Identify these quantities in the case of electromagnetism. 


IV.5.5. For a challenge show that t F”, which appears in higher dimensional theories 
such as string theory, are all total divergences. In other words, there exists a 
(2n — 1)-form w»,,_ (A) such that tr F” = da, (A). [Hint: A compact rep- 
resentation of the form @»,_,(A) = h dt fo,_\(t, A) exists.] Work out ws5(A) 
explicitly and try to generalize knowing w and w;. Determine the (2n — 1)-form 
fon—1(, A). For help, see B. Zumino et al., Nucl. Phys. B239:477, 1984. 


IV.5.6. Write down the Lagrangian of an SU (3) gauge theory with a fermion field in the 
fundamental or defining triplet representation. 


3A. Zee, “On the Non-Abelian Gauge Structure in Nuclear Quadrupole Resonance,” 
Phys. Rev. A38:1, 1988. The proposed experiment was later done by A. Pines. 


Chapter I'V.6 


The Anderson-Higgs Mechanism 


The gauge potential eats the Nambu-Goldstone boson 


As I noted earlier the ability to ask good questions is of crucial importance in 
physics. Here is an excellent question: How does spontaneous symmetry breaking 
manifest itself in gauge theories? 

Going back to Chapter IV.1, we gauge the U(1) theory in (IV.1.6) by replacing 
dy with Dy = (0, — ieA,)g so that 


£=-1F,,F*’ + (Dg)'De + po'g — aly) (1) 


Now when we go to polar coordinates g = pe’? we have Die = (8,0 + ip (8,8 — 
eA,,)le’? and thus 


L= —4F yy FY + p° (8,8 — eA,)° + (ao) 4 up" _ rp" Q2) 


(Compare this with & = p (3,0) + (Ap)? + up? — Ap" in the absence of the 
gauge field.) Under a gauge transformation g — e'“g (so that 6 — @ + q@) and 
eA, > eA, + 0,a@, and thus the combination B, = A, ~ (1/e)d,@ is gauge 
invariant. The first two terms in £ thus become —} FF" + e*p" Be. Note that 
Fy =0,A, — dA, =9,B, — 3,B, has the same form in terms of the potential 

“Upon spontaneous symmetry breaking, we write o = ( 1/V2)(v + x), with 
v= p2/4. Hence 


1 1 1 
_t uv , +ag2p2 , 2 2, 42.2p2 
L= ree + 5M Bs, +e UXB, + Fe KB, 


$+ 4 (ay)? — py? — Vi — 2x4 4 ue (3) 
2 4 4d 


The theory now consists of a vector field B,, with mass 
M=ev (4) 


interacting with a scalar field x with mass ~/2y. The phase field 6, which would 
have been the Nambu-Goldstone boson in the ungauged theory, has disappeared. 
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We say that the gauge field A,, has eaten the Nambu-Goldstone boson; it has gained 
weight and changed its name to B,,. 

Recall that a massless gauge field has only 2 degrees of freedom, while a massive 
gauge field has 3 degrees of freedom. A massless gauge field has to eat a Nambu- 
Goldstone boson in order to have the requisite number of degrees of freedom. 
The Nambu-Goldstone boson becomes the longitudinal degree of freedom of the 
massive gauge field. We do not lose any degrees of freedom, as we had better not. 

This phenomenon of a massless gauge field becoming massive by eating a 
Nambu-Goldstone boson was discovered by numerous particle physicists! and is 
known as the Higgs mechanism. People variously call g, or more restrictively x, 
the Higgs field. The same phenomenon was discovered in the context of condensed 
matter physics by Landau, Ginzburg, and Anderson, and is known as the Anderson 
mechanism. 

Letus give a slightly more involved example, an O(3) gauge theory with a Higgs 
field gy? (a = 1, 2, 3) transforming in the vector representation. The Lagrangian 
contains the kinetic energy term 1(D,,%)”, with D,, 97 = 0,97 + ge“ A? y° as 
indicated in (IV.5.21). Upon spontaneous symmetry breaking, @ acquires a vacuum 
expectation value which without loss of generality we can choose to point in the 
3-direction, so that (y*) = v87?. We set y? = v and see that 


3(D,y7)? > 3(gv)(Al AM! + AZ AM) (5) 


The gauge potential Al, and At acquires mass gu [compare with (4)] while A}, 
remains massless. 

A more elaborate example is that of an SU (5) gauge theory with ¢ transforming 
as the 24-dimensional adjoint representation. (See Appendix C for the necessary 
group theory.) The field g is a 5 by 5 hermitean traceless matrix. Since the 
adjoint representation transforms as p > gy + i0°[T®, g], we have Dy = d, 9 — 
igAt IT’, gy] with a = 1, .-- , 24 running over the 24 generators of SU(5). By a 
symmetry transformation the vacuum expectation value of g can be taken to be 
diagonal (g/.) = v 5 G, j=1,---,5), with 2, vj =0. (This is the analog of our 
choosing (@g) to point in the 3-direction in the preceding example.) We have in the 
Lagrangian 


tr(D,y)(D"y) > g? trT?, {g) ty), T’1A4 AM” (6) 


The gauge boson masses squared are given by the eigenvalues of the 24 by 24 
matrix g? tr[T%, (v)][(v), T°], which we can compute laboriously for any given 
(¢). 

It is easy to see, however, which gauge bosons remain massless. As a specific 
example (which will be of interest to us in Chapter VII.6), suppose 


‘Including P. Higgs, F. Englert, R. Brout, G. Guralnik, C. Hagen, and T. Kibble. 
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200 0 0 
020 0 0 

@=vl002 0 0 (7) 
000 -3 0 
000 0 -3 


‘ AO 
Which generators T* commute with (y)? Clearly, generators of the form ( 0 $) 


and of the form 4 2 ) do. Here A represents 3 by 3 hermitean traceless matrices 


(of which there are 3* — 1 = 8, the so-called Gell-Mann matrices) and B represents 
2 by 2 hermitean traceless matrices (of which there are 2” — 1 = 3, namely the Pauli 
matrices). Furthermore, the generator 


200 0 O 

020 0 O 

002 0 0 (8) 
00 0 -3 O 

000 0 -3 


being proportional to {g), obviously commutes with (¢). Clearly, these generators 
generate SU(3), SU(2), and U(1), respectively. Thus, in the 24 by 24 mass- 
squared matrix g? tr[T%, (y) ]{(~), T?] there are blocks of submatrices that vanish, 
namely, an 8 by 8 block, a3 by 3 block, and a 1 by 1 block. We have 8+ 3+ 1= 
12 massless gauge bosons. The remaining 24 — 12 = 12 gauge bosons acquire 
mass. 


Counting massless gauge bosons 


In general, consider a theory with the global symmetry group G spontaneously 
broken to a subgroup H. As we learned in Chapter IV.1, n(G) — n( 1) Nambu- 
Goldstone bosons appear. Now suppose the symmetry group G is gauged. We 
start with n(G) massless gauge bosons, one for each generator. Upon spontaneous 
symmetry breaking, the n(G) — n(H) Nambu-Goldstone bosons are eaten by 
n(G) — n(A) gauge bosons, leaving n(H) massless gauge bosons, exactly the 
right number since the gauge bosons associated with the surviving gauge group H 
should remain massless. 

In our simple example, G = U(1), H = nothing: n(G) = 1 and n(H) =0.In 
our second example, G = O(3), H = O(2) ~ U(1) :n(G) = 3 andn(A) = 1, and 
so we end up with two massless gauge bosons. In the third example, G = SU (5), 
H = SU (3) @ SU(2) @ U(1) so thatn(G) = 24 and n(H) = 12. Further examples 
and generalizations are worked out in the exercises. 


Gauge boson mass spectrum 


It is easy enough to work out the mass spectrum explicitly. The covariant derivative 
of a Higgs field is D,g = 9,9 + gAl Tp, where g is the gauge coupling, T¢ 
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are the generators of the group G when acting on gy, and AY the gauge potential 
corresponding to the ath generator. Upon spontaneous symmetry breaking we 
replace g by its vacuum expectation value (y) = v. Hence D,,g is replaced by 
gaily. The kinetic term j(D"g - D,,¢) {here (-) denotes the scalar product in 
the group G] in the Lagrangian thus becomes 


g°(T4v . T?uyAM AY, = 5 AMe (m7)? A’ 
where we have introduced the mass-squared matrix 
(uw)? = g°(T*v- Tv) (9) 


for the gauge bosons. [You will recognize (9) as the generalization of (4); also 
compare (5) and (6).] We diagonalize (47)“° to obtain the masses of the gauge 
bosons. The eigenvectors tell us which linear combinations of Af, correspond to 
mass eigenstates. 

Note that Tg is an n(G) by n(G) matrix with n(H) zero eigenvalues, whose 
existence can also be seen explicitly. Let T° be a generator of H . The statement that 
H remains unbroken by the vacuum expectation value v means that the symmetry 
transformation generated by T° leaves vu invariant; in other words, T°v = 0, and 
hence the gauge boson associated with T° remains massless, as it should. All these 
points are particularly evident in the SU (5) example we worked out. 


Feynman rules in spontaneously broken gauge theories 


It is easy enough to derive the Feynman rules for spontaneously broken gauge 
theories. Take, for example, (3). As usual, we look at the terms quadratic in the 
fields, Fourier transform, and invert. We see that the gauge boson propagator is 
given by 


i kiky 
Mie SH ” 
and the x propagator by 
H 
k?2 — 2p? + ie ay) 


I leave it to you to work out the rules for the interaction vertices. 

As I said in another context, field theories often exist in several equivalent 
forms. Take the U (1) theory in (1) and instead of polar coordinates go to Cartesian 
coordinates g = (1//2)(y, + i¢2) so that 


1 


Dig = 9,9 —ieAyg = B 


[(9,,9) + €A,,G2) +1(0,02 — eA, ¢1)] 
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Then (1) becomes 
L=—ZFyyF™ + 1Gy~1 + eAyor)” + (Oyo. — eA] (12) 
+ 5u°(y? + 9) — tay + 93)? 


Spontaneous symmetry breaking means setting g > v + gj with v = /?/A. 
The physical content of (12) and (3) should be the same. Indeed, expand the 
Lagrangian (12) to quadratic order in the fields: 


4 
£ = i — GF yyFh + 7MPA, — MA," Gy + G91) — 20°97] 
+ 5 (pp) + °°: (13) 


The spectrum, a gauge boson A with mass M = ev and a scalar boson re with 
mass +/21, is identical to the spectrum in (3). (The particles there were named B 
and x.) 

But oops, you may have noticed something strange: the term —M A,,0"@) which 
mixes the fields A,, and g). Besides, why is g> still hanging around? Isn’t he 
supposed to have been eaten? What to do? 

We can of course diagonalize but it is more convenient to get rid of this mixing 
term. Referring to the Fadeev-Popov quantization of gauge theories discussed in 
Chapter III.4 we note that the gauge fixing term generates a term to be added to 
£. We can cancel the undesirable mixing term by choosing the gauge function 
to be f(A) = 0A + Eevgy, — o. Going through the steps, we obtain the effective 
Lagrangian Leg = & — (1/2&)(@A + &Mg>)* [compare with (II1.4.7)]. The un- 
desirable cross term —MA,,0", in £ is now canceled upon integration by parts. 
In Leg the terms quadratic in A now read —} F,,F#" + 4M? A — (1/2&)(aA)? 
while the terms quadratic in g, read 11(3,p2)* — —M’,”], immediately giving us 
the gauge boson propagator 


=! | 1— gy) 14 
k2— M2 +ie Suv — ¢ 5) EM aie u%) 


and the g) propagator 


i 

k2? —EM2 +ie (15) 

This one-parameter class of gauge choices is known as the R; gauge. Note that 
the would-be Goldstone field gz remains in the Lagrangian, but the very fact that 
its mass depends on the gauge parameter & brands it as unphysical. In any physical 
process, the € dependence in the g) and A propagators must cancel out so as to 
leave physical amplitudes € independent. In Exercise IV.6.9 you will verify that 
this is indeed the case in a simple example. 
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Different gauges have different advantages 


You might wonder why we would bother with the R; gauge. Why not just use 
the equivalent formulation of the theory in (3), known as the unitary gauge, in 
which the gauge boson propagator (10) looks much simpler than (14) and in which 
we don’t have to deal with the unphysical g field? The reason is that the R; 
gauge and the unitary gauge complement each other. In the R; gauge, the gauge 
boson propagator (14) goes as 1/k* for large k and so renormalizability can be 
proved rather easily. On the other hand, in the unitary gauge all fields are physical 
(hence the name “unitary’”’) but the gauge boson propagator (10) apparently goes 
as k,,k, /k? for large k; to prove renormalizability we must show that the k uky piece 
of the propagator does not contribute. Using both gauges, we can easily prove that 
the theory is both renormalizable and unitary. By the way, note that in the limit 
— —> co (14) goes over to (10) and g, disappear, at least formally. 

In practical calculations, there are typically many diagrams to evaluate. In the 
R; gauge, the parameter € darn well better disappears when we add everything 
up to form the physical mass shell amplitude. The RX; gauge is attractive precisely 
because this requirement provides a powerful check on practical calculations. 

I remarked earlier that strictly speaking, gauge invariance is not so much a 
symmetry as the reflection of a redundancy in the degrees of freedom used. (The 
photon has only 2 degrees of freedom but we use a field A,, with 4 components.) A 
purist would insist, in the same vein, that there is no such thing as spontaneously 
breaking a gauge symmetry. To understand this remark, note that spontaneous 
breaking amounts to setting o = |g| to v and @ to 0 in (2). The statement |g| = v 
is perfectly U (1) invariant: It defines a circle in g space. By picking out the point 
@ =0 on the circle in a globally symmetric theory we break the symmetry. In 
contrast, in a gauge theory, we can use the gauge freedom to fix 6 = 0 everywhere 
in spacetime. Hence the purists. I will refrain from such hair-splitting in this book 
and continue to use the convenient language of symmetry breaking even in a gauge 
theory. 


Exercises 


IV.6.1. Consider an SU(5) gauge theory with a Higgs field y transforming as the 5- 
dimensional representation: g’, i = 1,2, - +. , 5. Show that a vacuum expectation 
value of g breaks SU(5) to SU (4). Now add another Higgs field gy’, also trans- 
forming as the 5-dimensional representation. Show that the symmetry can either 
remain at SU (4) or be broken to SU (3). 


IV.6.2. In general, there may be several Higgs fields belonging to various representations 
labeled by a. Show that the mass squared matrix for the gauge bosons generalize 
immediately to (u*)® = 7, g7(T4v, - T°v,), where vy is the vacuum expec- 
tation value of y, and T° is the ath generator represented on y, . Combine the 
situations described in Exercises IV.6.1 and I'V.6.2 and work out the mass spec- 
trum of the gatige bosons. 
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IV.6.3. 


IV.6.4. 


IV.6,5. 


IV.6.6. 


IV.6.7. 


IV.6.8. 


IV.6.9. 


IV. Symmetry and Symmetry Breaking 


The gauge group G does not have to be simple; it could be of the form G, ® G2.® 
-+-@ G,, with coupling constants 9), 92, ---, 9,. Consider, for example, the case 
G = SU (2) @ U (1) anda Higgs field g transforming like the doublet under SU (2) 
and like a field with charge 5 under U(1), so that D,,9 = 4,9 — i[gAt (07/2) + 
eB, 4}g. Let (9) = (° ) _ Determine which linear combinations of the gange 
bosons Av, and B,, acquire mass. 


In Chapter IV.5 you worked out an SU (2) gauge theory with a scalar field ¢ in 
the J = 2 representation. Write down the most general quartic potential V (g) and 
study the possible symmetry breaking pattern. 


Complete the derivation of the Feynman rules for the theory in (3) and compute 
the amplitude for the physical process x + x > B+ B. 


Derive (14). [Hint: The procedure is exactly the same as that used to obtain 
(111.4.13).] Write£ = 4A, Q*"A, with Q#” = (7 + M*)g#” — [1— (1/é)]0#a” 
or in momentum space Q#” = —(k* — M*)g#” + [1— (1/£)]k#k”. The propa- 
gator is the inverse of Q#”. 


Work out the (---) in (13) and the Feynman rules for the various interaction 
vertices. 


Using the Feynman rules derived in Exercise IV.6.8 calculate the amplitude for 
the physical process gy, + 9g, -> A+ A and show that the dependence on € cancels 
out. Compare with the result in Exercise IV.6.6. [Hint: There are two diagrams, 
one with A exchange and the other with g, exchange. ] 


Consider the theory defined in (12) with 4. = 0. Using the result of Exercise IV.3.6 
show that 
1, 4 1 2 4,4 ( ¢ 25 
4 = —Ag’ + —— (100° + 3e log —-—)+-::- (16 
eft (P) 4? Gan? )p m6 (16) 
where g? = ye + ¢; . This potential has a minimum away from g = 0 and thus 
the gauge symmetry is spontaneously broken by quantum fluctuations. In Chapter 
IV.3 we did not have the e* term and argued that the minimum we got there was 
not to be trusted. But here we can balance the Ag* against e*g* log(y7/M7) for A 
of the same order of magnitude as e*. The minimum can be trusted. Show that the 
spectrum of this theory consists of a massive scalar boson and a massive vector 
boson, with 
m? (scalar) _ 3 e 


= 17 
m2(vector) 27 47 (17) 


For help, see S. Coleman and E. Weinberg, Phys. Rev. D7: 1883, 1973. 


Chapter IV.7 
Chiral Anomaly 


Classical versus quantum symmetry 


I have emphasized the importance of asking good questions. Here is another 
good one: Is a symmetry of classical physics necessarily a symmetry of quantum 
physics? 

We have a symmetry of classical physics if a transformation g > g + dg 
leaves the action $(g) invariant. We have a symmetry of quantum physics if the 
transformation leaves the path integral f Dge!*©) invariant. 

When our question is phrased in this path integral language, the answer seems 
obvious: Not necessarily. Indeed, the measure Dg may or may not be invariant. 

Yet historically, field theorists took as almost self-evident the notion that any 
symmetry of classical physics is necessarily a symmetry of quantum physics, and 
indeed, almost all the symmetries they encountered in the early days of field theory 
had the property of being symmetries of both classical and quantum physics. 
For instance, we certainly expect quantum mechanics to be rotational invariant. 
It would be very odd indeed if quantum fluctuations were to favor a particular 
direction. 

You have to appreciate the frame of mind that field theorists operated in to 
understand their shock when they discovered in the late 1960s that quantum 
fluctuations can indeed break classical symmetries. Indeed, they were so shocked 
as to give this phenomenon the rather misleading name “anomaly,” as if it were 
some kind of sickness of field theory. With the benefits of hindsight, we now 
understand the anomaly as being no less conceptually innocuous as the elementary 
fact that when we change integration variables in an integral we better not forget 
the Jacobian. 

With the passing of time, field theorists have developed many different ways of 
looking at the all important subject of anomaly. They are all instructive and shed 
different lights on how the anomaly comes about. For this introductory text I choose 
to show the existence of anomaly by an explicit Feynman diagram calculation. The 
diagram method is certainly more laborious and less slick than other methods, but 
the advantage is that you will see a classical symmetry vanishing in front of your 
very eyes! No smooth formal argument for us. 
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The lesser of two evils 


Consider the theory of a single massless fermion £ = wi y"d,w. You can hardly 
ask for a simpler theory! Recall from Chapter II.1 that £ is manifestly invariant 
under the transformations y —> ey and y —> e'°Y’ y, corresponding to the con- 
served vector current J! = yyw and the conserved axial current Je = py yw 
respectively. You should verify that 0,,/* = 0 and a, Ji = 0 follow immediately 
from the classical equation of motion iy"d, yp = 0. 

Let us now calculate the amplitude for a spacetime history in which a fermion- 
anti-fermion pair is created at x, and another such pair is created at x2 by the 
vector current, with the fermion from one pair annihilating the antifermion from 
the other pair and the remaining fermion-antifermion pair being subsequently 
annihilated by the axial current. This is a long-winded way of describing the 
amplitude (0| T J2(0)J#(x,)J” (x2) |0) in words, but I want to make sure that you 
know what I am talking about. Feynman tells us that the Fourier transform of this 
amplitude is given by the two “triangle” diagrams in Figure IV.7.1a and b. 


da‘ 
AM” (ky, ky) = -pP f 2 


tr (yrs—t_e_ bb Pt _pw—t_ yt) a 
(ry p-d p-h BY pa" ph” Bp) 
with gq =k, + ky. Note that the two terms are required by Bose statistics. The 
overall factor of (—1) comes from the closed fermion loop. 

Classically, we have two symmetries implying 3, J“ = 0 and a, J£ = 0. In the 
quantum theory, if 0,7 * = 0 continues to hold, then we should have k, Arey = 
and ky, A’#” = 0, and if 3, Jé° = 0 continues to hold, then q, A*“” = 0. Now that 
we have things all set up, we merely have to calculate A+” to see if the two 
symmetries hold up under quantum fluctuations. No big deal. 

Before we blindly calculate, however, let us ask ourselves how sad we would 
be if either of the two currents J’ and Je fails to be conserved. Well, we would 
be very upset if the vector current is not conserved. The corresponding charge 


wv tad 


Figure IV.7.1 
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Q = f d*xJ° counts the number of fermions. We wouldn’t want our fermions 
to disappear into thin air or pop out of nowhere. Furthermore, it may please us 
to couple the photon to the fermion field yr. In that case, you would recall from 
Chapter II.7 that we need 0, J" = 0 to prove gauge invariance and its consequence 
that the photon has only two degrees of polarization. More explicitly, imagine 
a photon line coming into the vertex labeled by yz in Figure IV.7.1a and b with 
propagator (i /k?)[E (kip kt) /k?) — Sup). The gauge dependent term &(ky,,k; of K?) 
would not go away if the vector current is not conserved, that is, if k, Aree fails 
to vanish. 

On the other hand, quite frankly, just between us friends, we won’t get too upset 
if quantum fluctuation violates axial current conservation. Who cares if the axial 
charge Q° = f dx Ig is not constant in time? 


Shifting integration variable 


So, do k; pare and kp, A*#" vanish? We will look over Professor Confusio’s 
shoulders as he calculates k;,, A*#¥ (We are now in the 1960s, long after the 
development of renormalization theory as described in Chapter ITI.1 and Confusio 
has managed to get a tenure track assistant professorship.) He hits A’“” as written 
in (1) with k;,, and using what he learned in Chapter II.7 writes /; in the first term 
as p — (p — &#,) and inthe second term as ( p — #,) —( p— ¢), thus obtaining 


ky A" (ky, ky) 


1 a a3 , 

“if oy "ory > von a P ” 
Just as in Chapter II.7, Confusio recognizes that in the integrand the first term is 
just the second term with the shift of the integration variable p > p — k,. The two 
terms cancel and Professor Confusio publishes a paper saying k, pare = 0, as we 
all expect. 

Remember back in Chapter II.7 I said we were going to worry later about 
whether it is legitimate to shift integration variables. Now is the time to worry! 

You could have asked your calculus teacher long ago when it is legitimate to 
shift integration variables. When is fre dpf (p +a) equal to ["— +90 dpf(p)? The 
difference between these two integrals i is 


+o00 d 
[ av@F pp) ++) = al fe+00) — F(-o0y ++ 
oo dp 
Clearly, if f(+00) and f(—oo) are two different constants, then it is not okay 
to shift. But if the integral fre dpf (p) is convergent, or even logarithmically 
divergent, it is certainly okay. It was okay in Chapter II.7 but definitely not here 
in (2)! 

As usual, we rotate the Feynman integrand to Euclidean space. Generalizing 
our observation above to d-dimensional Euclidean space, we have 
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| dpire +a) — fon= f afptara, £(p) ++] 


which by Gauss’s theorem is equal to 


Poo 


P 
lim a? (+) f(P)Sg_1(P) 


where S,_1(P) is the area of a (d — 1)-dimensional sphere (see Appendix E) and 
where an average over the surface of the sphere is understood. (Recall from our 
experience evaluating Feynman diagrams that the average of P# P”/ P? is equal 
to 1n”” by a symmetry argument, with the normalization } fixed by contracting 
with 77,,,.) Rotating back, we have for a 4-dimensional Minkowskian integral 


d _ _ a ( Pu 2 p3 
i a“ pLf(p +4) — f(p)|= im ia (3 x «) f(P\(2n2P) 3) 


Note the i from Wick rotating back. 


Applying (3) with 
1 L\_ why? (pb — by” By") _ die kor Do 
f(py=tr Ca yt)= p- Dy By ar P 
P-— kp (p — k3)*p (p —ko)*p 
we obtain 
P,, diet OAK 
kyAre? = i(— ky ad ig’ ar Py 7? P? _ Sa ae oe 


(27)4 Pte p4 

Contrary to what Confusio said, k,,,A*"” # 0. 
As I have already said, this would be a disaster. Fermion number is not con- 

served and matter would be disintegrating ali around us! What is the way out? 

In fact, we are only marginally smarter than Professor Confusio. We did not 
notice that the integral defining A*“” in (1) is linearly divergent and is thus not 
well defined. 

Oops, even before we worry about calculating k,,, Aru and k>, A*#” we better 
worry about whether or not A*“” depends on the physicist doing the calculation. 
In other words, suppose another physicist chooses! to shift the integration variable 
p in the linearly divergent integral in (1) by an arbitrary 4-vector a and define 


AY (a, ki, k>) 


d‘ 1 1 
= —] pf P tr wy PL 
COT Om’ Ged d" ped- hy pea 
+ {u, kyo v, ko} (4) 


' This is the freedom of choice in labeling internal momenta mentioned in Chapter I.7. 
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There can be as many results for the Feynman diagrams in Figure IV.7.1a and b 
as there are physicists! That would be the end of physics, or at least quantum field 
theory, for sure. 

Well, whose result should we declare to be correct? 

The only sensible answer is that we trust the person who chooses an a such that 
ki A*"”(a, ky, ka) and ky, A” (a, kj, ky) vanish, so that the photon will have the 
right number of degrees of freedom should we introduce a photon into the theory. 

Let us compute ar(a, ky, ky) — A)” (k,, ky) by applying (3) to f(p) = 
tr(y*y> - = hae EF, 7 sy 5): Noting that 


tr(y*y> Py” Py" P) 
Pé 


_ 2PHtr(y*y? Py” P)— PP t(y*y? Py’y") _ —4iP? Per 
pé pé 


FP) i 


we see that 


4i P,P 
A” (a, ky, ky) — AM (ky, ky) = ae erent + {, kyo v, ky} 


8x2 Pw 


=— 223 Tg, + {ky v, ky} (5) 
There are two independent momenta k, and k, in the problem, so we can take 
a=a(k; +k.) + B(k, — k). Plugging into (5), we obtain 


A (a, ky, ky) = A” (ky, ky) + een (ky — kao (6) 
ri 


Note that a drops out. 

As expected, APY (a, k1, ky) depends on 8, and hence on a. Our unshakable 
desire to have a conserved vector current, that is, ky,, A+” (a, ky, ky) =0, now fixes 
the parameter 6 upon recalling 


A ia 
ky ae (ky, ky) = ane’ ? Kirke 


Hence, we must choose to deal with A’“”(a, k,, ky) with B = —}. 

One way of viewing all this is to say that the Feynman rules do not suffice 
in determining (0| T J3(0)J * (x1) J°(x2) |0). They have to be supplemented by 
vector current conservation. The amplitude (0| T Je (0) J# (x1) J’ (x5) |0} is defined 
by A*#"(a, ky, kz) with B = 3. 


Quantum fluctuation violates axial current conservation 


Now we come to the punchline of the story. We insisted that the vector current be 
conserved. Is the axial current conserved? 
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To answer this question, we merely have to compute 
i 
qn 0a, ky, ke) = Gy (ke, ka) + Fe ke kag (7) 


By now, you know how to do this: 


1 1 
(v° y” yt 


4 
AM Ue, k =i | oP 
BAT ID=! I Oy pq pb-h 


5 1 v 1 “) 
—y y —Y" | +{h, ky ov, ky} 
p-— hk, p 


= a BviAa 

=e kiko (8) 
Indeed, you recognize that the integration has already been done in (2). We finally 
obtain 


i 
qb” (a, ky, ky) = 530 kukre (9) 


The axial current is not conserved! 

In summary, in the simple theory £ = Wi y“d,w while the vector and axial 
currents are both conserved classically, quantum fluctuation destroys axial current 
conservation. This phenomenon is known variously as the anomaly, the axial 
anomaly, or the chiral anomaly. 


Consequences of the anomaly 


As I said, the anomaly is an extraordinarily rich subject. I will content myself with 
a series of remarks, the details of which you should work out as exercises. 


1. Suppose we gauge our simple theory L = piy"(3,, — ieA,,) and speak of 
A,, as the photon field. Then in Figure IV.7.1 we can think of two photon 
lines coming out of the vertices labeled yz and v. Our central result (9) can 
then be written elegantly as two operator equations: 


CLASSICAL PHYSICS: 3,,J5, = 0 (10) 


2 
QUANTUM PHYsics: 3, J5° = aa Fay Fig (11) 


The divergence of the axial current 0 A Js is not zero, but is an operator 
capable of producing two photons. 

2. Applying the same type of argument as in Chapter IV.2 we can calculate the 
rate of the decay 7° > y + y. Indeed, historically people used the erroneous 
result (10) to deduce that this experimentally observed decay cannot occur! 
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See Exercise IV.7.2. The resolution of this apparent paradox led to the correct 
result (11). 

. Writing the Lagrangian in terms of left and right handed fields yp and 
yr, and introducing the left and right handed currents Jb = Wry“yp and 
Jt = yw, we can repackage the anomaly as 


9 jt = 1 <a pvio 
and 


1 e 
Bb AG 
wt = 2am FuvF yo (12) 


(Hence the name chiral!) We can think of left handed and right handed 
fermions running around the loop in Figure IV.7.1, contributing oppositely 
to the anomaly. 

. Consider the theory £ = w(i yd, —m)y. Then invariance under the trans- 
formation y —> e!9’°y is spoiled by the mass term. Classically, a, Js = 
2m ypiy>y: The axial current is explicitly not conserved. The anomaly now 


says that quantum fluctuation produces an additional term. In the theory 
L=wliy’(, —ieA,) — my, we have 


a, Jt = Impiy?y + —— el” F.,, Fig (13) 


aan m)? 


. Recall that in Chapter III.7 we introduced Pauli- Villars regulators to calcu- 
late vacuum polarization. We subtract from the integrand what the integrand 
would have been if the electron mass were replaced by some regulator mass. 
The analog of electron mass in (1) is in fact 0 and so we subtract from the in- 
tegrand what the integrand would have been if 0 were replaced by a regulator 
mass M. In other words, we now define 


aM (kas ky) = (DF f SF; d’ (yy Y y” y" ut 
(27)4 bB-G¢ p-k p 
Aa 5 1 y—_—i 1 <1) 
wT p—-q-M” p-h-M 
+ {u, kyo v, kp}. (14) 


Note that as p — oo the integrand now vanishes faster than 1/p>. This is 
in accordance with the philosophy of regularization outlined in Chapters 
Ifl.1 and IJE.7: For p < M, the threshold of ignorance, the integrand is 
unchanged. But for p >> M, the integrand is cut off. Now the integral in 
(14) is superficially logarithmically divergent and we can shift the integration 
variable p at will. 
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ry ry 


(a) (b) 


Figure IV.7.2 


So how does the chiral anomaly arise? By including the regulator mass 
M we have broken axial current conservation explicitly. The anomaly is the 
statement that this breaking persists even when we let M tend to infinity. It 
is extremely instructive (see Exercise IV.7.4) to work this out. 


. Consider the nonabelian theory £ = piy"(3, —i gAtT“)y. We merely 


have to include in the Feynman amplitude a factor of T° at the vertex labeled 
by yz and a factor of T° at the vertex labeled by v. Everything goes through 
as before except that in summing over all the different fermions that run 
around the loop we obtain a factor tr T°T”. Thus, we see instantly that in a 
nonabelian gauge theory 


2 
Ji = ane tt Fy, Fig (15) 


where F',,, = F nye is the matrix field strength defined in Chapter IV.5. 
Nonabelian symmetry tells us something remarkable: The object e447 
tr F,, 4, contains not only a term quadratic in A, but also terms cubic and 
quartic in A, and hence there is also a chiral anomaly with three and four 
gauge bosons coming in, as indicated in Figure IV.7.2a and b. Some people 
refer to the anomaly produced in Figures IV.7.1 and IV.7.2 as the triangle, 
square, and pentagon anomaly. Historically, after the triangle anomaly was 
discovered, there was a controversy as to whether the square and pentagon 
anomaly existed. The nonabelian symmetry argument given here makes 
things totally obvious, but at the time people calculated Feynman diagrams 
explicitly and, as we just saw, there are subtleties lying in wait for the unwary. 


. We will see in Chapter V.7 that the anomaly has deep connections to 


topology. 


. We computed the chiral anomaly in the free theory £ = wi yd, ~m)y. 


Suppose we couple the fermion to a scalar field by adding fey or to the 
electromagnetic field for that matter. Now we have to calculate higher order 
diagrams such as the three-loop diagram in Figure IV.7.3. You would expect 


10. 
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Figure IV.7.3 


that the right-hand side of (9) would be multiplied by 1+ A(/, e, - - -), where 
h is some unknown function of all the couplings in the theory. 

Surprise! Adler and Bardeen proved that h = 0. This apparently mirac- 
ulous fact, known as the nonrenormalization of the anomaly, can be under- 
stood heuristically as follows. Before we integrate over the momenta of the 
scalar propagators in Figure IV.7.3 (labeled by W, and W.) the Feynman 
integrand has seven fermion propagators and thus is more than sufficiently 
convergent that we can shift integration variables with impunity. Thus, be- 
fore we integrate over W, and W> all the appropriate Ward identities are sat- 
isfied, for instance, q,, A ops (k1, ko; W;, W>) = 0. You can easily complete 


the proof. You will give a proof? based on topology in Exercise (V.7.13). 


. The preceding point was of great importance in the history of particle physics 


as it led directly to the notion of color, as we will discuss in Chapter VIL.3. 
The nonrenormalization of the anomaly allowed the decay amplitude for 
n° > y + y to be calculated with confidence in the late 1960s. In the quark 
model of the time, the amplitude is given by an infinite number of Feynman 
diagrams, as indicated in Figure IV.7.4, but the nonrenormalization of the 
anomaly tells us that only Figure IV.7.4a contributes. In other words, the 
amplitude does not depend on the details of the strong interaction. That it 
came out a factor of 3 too small suggested that quarks come in 3 copies, as 
we will see in Chapter VIL3. 

It is natural to speculate as to whether quarks and leptons are composites of 
yet more fundamental fermions known as preons. The nonrenormalization 
of the chiral anomaly provides a powerful tool for this sort of theoretical 
speculation. No matter how complicated the relevant interactions might be, 
as long as they are described by field theory as we know it, the anomaly at 
the preon level must be the same as the anomaly at the quark-lepton level. 


? For a simple proof not involving topology, see J. Collins, Renormalization, page 352. 
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---«@----- 


(a) (b) 


Figure IV.7.4 


This so-called anomaly matching condition? severely constrains the possible 
preon theories. 

11. Historically, field theorists were deeply suspicious of the path integral, 
preferring the canonical approach. When the chiral anomaly was discovered, 
some people even argued that the existence of the anomaly proved that the 
path integral was wrong. Look, these people said, the path integral 


i Di Di e! f dxbiy" @,-iA,)v (16) 


is too stupid to tell us that it is not invariant under the chiral transformation 
y —> e!9Y° y. Fujikawa resolved the controversy by showing that the path 
integral did know about the anomaly: Under the chiral transformation the 
measure Dy Dy changes by a Jacobian. Recall that this was how I motivated 
this chapter: The action may be invariant but not the path integral. 


Exercises 


IV.7.1, Derive (11) from (9). The momentum factors k,, and k>, in (9) become the two 
derivatives in F,,,F,, in (11). 


IV.7.2. Following the reasoning in Chapter IV.2 and using the erroneous (10) show that 
the decay amplitude for the decay 7° > y + y would vanish in the ideal world 
in which the ° is massless. Since the 2° does decay and since our world is close 
to the ideal world, this provided the first indication historically that (10) cannot 
possibly be valid. 


3G, ’t Hooft, in: G. ’t Hooft et al., eds., Recent Developments in Gauge Theories; A. 
Zee, Phys. Lett. 95B:290, 1980. 


IV.7.3. 


IV.7.4. 


IV.7.5. 
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Repeat all the calculations in the text for the theory & = w(iy"“a,, — m)w. 


Take the Pauli-Villars regulated A4"” (k,, k») and contract it with g,. The analog 
of the trick in Chapter II.7 is to write gy° in the second term as [2M + ( p — 
M) —( p-— ¢+M)ly°. Now you can freely shift integration variables. Show 
that 


qi, B*"” (iy, ky) = —2MA"*(ky, ky) (17) 


where 


d‘ 
pv = (_nW3 P 
AM” (ky, ky) = (-Di J aa 


5 1 y 1 ul 
«(y p-d-M’ p-¥,-M ai) Tie hey B) 
Evaluate A“” and show that A”” goes as 1/M in the limit M — oo and so the 
right hand side of (17) goes to a finite limit. The anomaly is what the regulator 
leaves behind as it disappears from the low energy spectrum: It is like the smile 
of the Cheshire cat. [We can actually argue that A" goes as 1/M without doing 
a detailed calculation. By Lorentz invariance and because of the presence of 
y>, S“” must be proportional to e“”*"k,, k» p» but by dimensional analysis, A”” 
must be some constant times eh ky Key, /M. You might ask why we can’t use 
something like 1/(k?) 3 instead of 1/M to make the dimension come out right. The 
answer is that from your experience in evaluating Feynman diagrams in (3 + 1)- 

dimensional spacetime you can never get a factor like 1/ (k?) 2 .] 


There are literally N ways of deriving the anomaly. Here is another. Evaluate 


d* 
Ap — (173 P 
a(R, ka) = (DP J OG 
1 1 1 
tr ( y+y° ——__ py’ _____ ) + tak @ v, ky} 
( B-d—-m p-k—m pm mee 


in the massive fermion case not by brute force but by first using Lorentz invariance 
to write 


AMY ey, ky) = MM kg Ay te beth ka kh Ag 


where A; = A;(k?, k3, q7) are eight functions of the three Lorentz scalars in 
the problem. You are supposed to fill in the dots. By counting powers as in 
Chapters IHI.3 and III.7 show that two of these functions are given by superficially 
logarithmically divergent integrals while the other six are given by perfectly 
convergent integrals. Next, impose Bose statistics and vector current conservation 
ky, 4*"" = 0 = ky, A*” to show that we can avoid calculating the superficially 
logarithmically divergent integrals. Compute the convergent integrals and then 
evaluate g, A*#” (k,, ky). 
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IV.7.7. 


IV.7.8. 
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Discuss the anomaly by studying the amplitude 
(0| TJ3 (0) J (xy) JS (2) |0) 


given in lowest orders by triangle diagrams with axial currents at each ver- 
tex. [Hint: Call the momentum space amplitude A*"”(k,, k).] Show by using 
(vy)? = 1 and Bose symmetry that 


AS” (ki, ko) = 4[A(a, ky, ky) + AMY, ky, —g) + Aa, -9, ky] 


Now use (9) to evaluate g, A3”"(k,, ky). 


Define the fermionic measure Dy in (16) carefully by going to Euclidean space. 
Calculate the Jacobian upon a chiral transformation and derive the anomaly. [Hint: 
For help, see K. Fujikawa, Phys. Rev. Lett. 42: 1195, 1979.} 


Compute the pentagon anomaly by Feynman diagrams in order to check re- 
mark 6 in the text. In other words, determine the coefficient c in 3,, Jf = +--+ 
cehAS tr A AYA, AG. 


PART V 


FIELD THEORY 
AND 
COLLECTIVE PHENOMENA 


I mentioned in the introduction that one of the more intellectually satisfying 
developments in the last two or three decades has been the increasingly important 
role played by field theoretic methods in condensed matter physics. This is a rich 
and diverse subject; in this and subsequent chapters I can barely describe the tip 
of the iceberg and will have to content myself with a few selected topics. 

Historically, field theory was introduced into condensed matter physics in 
a rather direct and straightforward fashion. The nonrelativistic electrons in a 
condensed matter system can be described by a field w, along the lines discussed 
in Chapter IIL.5. Field theoretic Lagrangians may then be written down, Feynman 
diagrams and rules developed, and so on and so forth. This is done in a number of 
specialized texts. What we present here is to a large extent the more modern view 
of an effective field theoretic description of a condensed matter system, valid at 
low energy and momentum. One of the fascinations of condensed matter physics is 
that due to highly nontrivial many body effects the low energy degrees of freedom 
might be totally different from the electrons we started out with. A particularly 
striking example (to be discussed in Chapter VI.2) is the quantum Hall system, 
in which the low energy effective degree of freedom carries fractional charge and 
statistics. 

Another advantage of devoting a considerable portion of a field theory book to 
condensed matter physics is that historically and pedagogically it is much easier to 
understand the renormalization group in condensed matter physics than in particle 
physics. 

I will defiantly not stick to a legalistic separation between condensed matter and 
particle physics. Some of the topics treated in Parts V and VI actually belong to 
particle physics. And of course I cannot be responsible for explaining condensed 
matter physics, any more than I could be responsible for explaining particle physics 
in Chapter IV.2. 


Chapter V.1 
Superfluids 


Repulsive bosons 


Consider a finite density 6 of nonrelativistic bosons interacting with a short ranged 
repulsion. Return to (III.5.6): 


; 1 - 
£ =ig' dy — —a,9'd;9 — 8°(y"g — BY (1) 
2m 
The last term is exactly the Mexican well potential of Chapter IV.1, forcing the 
magnitude of g to be close to ./p , thus suggesting that we use polar variables 


y = /pe'® as we did in (IIL.5.7). Plugging in and dropping the total derivative 
(i/2)8yp, we obtain 


L 


1 1 - 
= — pa — — Fro + o(@0? — (ep — py (2) 
\ 2m (4p 


Spontaneous symmetry breaking 


As in Chapter IV.1 write ./p = ./p + h (the vacuum expectation value of gis./p), 
assume h < ./p, and expand!: 


£ = —2y/pnayo — (0,0)? — —(@,hy? — 4g? ph? + (3) 
2m. 2m 


Picking out the terms up to quadratic in A in (3) we use the “central identity of 
quantum field theory” (see Appendix A) to integrate out h, obtaining 


L=p 


1 p 2 
80 ————-a@ — ?- (9,6)? +--- 
eo 4976 — (1/2m)a? 9 am “i 


~ | oy #aey24--. 
= 7g7 Oo) on (3,0)° + (4) 


‘Note that we have dropped the (potentially interesting) term —p0 because it is a total 
divergence. 


257 


258 V. Field Theory and Collective Phenomena 


In the second equality we assumed that we are looking at processes with wave 
number k small compared to ,/8g2m so that (1/ 2m) a? is negligible compared to 
47. Thus, we see that there exists in this fluid of bosons a gapless mode (often 
referred to as the phonon) with the dispersion 


2 - 
wo = 28'P j2 (5) 
m 
The learned among you will have realized that we have obtained Bogoliubov’s 
classic result without ever doing a Bogoliubov rotation (see the appendix to this 
chapter. 

Let me briefly remind you of Landau’s idealized argument? that a linearly 
dispersing mode (that is, w is linear in k) implies superfluidity. Consider a mass 
M of fluid flowing down a tube with velocity v. It could lose momentum and slow 
down to velocity v’ by creating an excitation of momentum k: Mv = Mv’ + hk. 
This is only possible with sufficient energy to spare if 1 Mv? > 1Mv? + hak). 
Eliminating v’ we obtain for M macroscopic v > w/k. For a linearly dispersing 
mode this gives a critical velocity v, =«@/k below which the fluid cannot lose 
momentum and is hence super. [Thus, from (5) the idealized uv, = g,/20/m.] 

Suitably scaling the distance variable, we can summarize the low energy physics 
of superfluidity in the compact Lagrangian 


1 
L= 7g3 Pu” (6) 


which we recognize as the massless version of the scalar field theory we studied 
in Part I, but with the important proviso that @ is a phase angle field, that is, 9(x) 
and 8(x) + 27 are really the same. This gapless mode is evidently the Nambu- 
Goldstone boson associated with the spontaneous breaking of the global U(1) 
symmetry g — e'%9. 


Linearly dispersing gapless mode 


The physics here becomes particularly clear by thinking about a gas of free bosons. 
We can give a momentum Ak to any given boson at the cost of only AR /2m in 
energy. There exist many low energy excitations in a free boson system. But as 
soon as a short ranged repulsion is turned on between the bosons, a boson moving 
with momentum k would affect all the other bosons. A density wave is set Up as a 
result, with energy proportional to k as we have shown in (5). The gapless mode 
has gone from quadratically dispersing to linearly dispersing. There are far fewer 
low energy excitations. Specifically, recall that the density of states is given by 
N(E) «k?-\(dk/dE). For example, for D = 2 the density of states goes from 


2. D. Landau and E. M. Lifshitz, Statistical Physics, p. 192. 
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N(£) «constant (in the presence of quadratically dispersing modes) to N(E) « E 
(in the presence of linearly dispensing modes) at low energies. 

As was emphasized by Feynman? among others, the physics of superfluidity lies 
not in the presence of gapless excitations, but in the paucity of gapless excitations. 
(After all, the Fermi liquid has a continuum of gapless modes.) There are too few 
modes that the superfluid can lose energy and momentum to. 


Relativistic versus nonrelativistic 


This is a good place to discuss one subtle difference between spontaneous sym- 
metry breaking in relativistic and nonrelativistic theories. Consider the relativistic 
theory studied in Chapter IV.1: £ =(@+)(a@) — (61 — v?)?. It is often conve- 
nient to take the 1. — co limit holding v fixed. In the language used in Chapter IV. 1 
“climbing the wall” costs infinitely more energy than “rolling along the gutter.” The 
resulting theory is defined by 


£L =(80')(9) (7) 


with the constraint ®'@ = y?. This is known as a nonlinear o model, about which 
much more in Chapter VL4. 

The existence of a Nambu-Goldstone boson is particularly easy to see in the 
nonlinear 0 model. The constraint is solved by @ = ve'?, which when plugged 
into £ gives £ = v*(96)*. There it is: the Nambu-Goldstone boson 6. 

Let’s repeat this in the nonrelativistic domain. Take the limit g* > oo with 6 
held fixed so that (1) becomes 


. 1 
L=iglay — a ai? div (8) 


with the constraint g'g = (6. But now if we plug the solution of the constraint g = 
/pe'® into £ (and drop the total derivative —d99), we get & = —(6/2m)(3,0)* 
with the equation of motion a70 = (0). Oops, what is this? It’s not even a propagating 
degree of freedom? Where is the Nambu-Goldstone boson? 

Knowing what I already told you, you are not going to be puzzled by this 
apparent paradox‘ for long, but believe me, I have stumped quite a few excellent 
relativistic minds with this one. The Nambu-Goldstone boson is still there, but as 
we can see from (5) its propagation velocity @/k scales to infinity as g and thus it 
disappears from the spectrum for any nonzero k. 

Why is it that we are allowed to go to this “nonlinear” limit in the relativistic 
case? Because we have Lorentz invariance! The velocity of a linearly dispersing 
mode, if such a mode exists, is guaranteed to be equal to 1. 


3R. P. Feynman, Statistical Mechanics. 
4This apparent paradox was discussed by A. Zee, “From Semionics to Topological 
Fluids” in O. J. P. Ebolic et al., eds., Particle Physics, p. 415. 
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Appendix 


I find that in many texts Bogoliubov’s calculation is obscured by a whole load of formalism. 
Let me take this opportunity to show you how simple it is to demonstrate that there is a 
gapless mode. Use a second quantized formalism with the Hamiltonian 


ke 
H= > aja, + > Glajata, ays pq) 
, 2m pa 


where a; and a, denote creation and annihilation operator for a boson of 3-momentum 
k (with the arrow suppressed in subscripts for notational simplicity), Recognizing that 
a macroscopic defisity of bosons condenses into the k=0 noninteracting ground state 
Bogoliubov replaces aj and a; by c-numbers (ap) = (a}) =u, thus giving to quadratic 
order 


ke 

H~ > 5 8 + > Gu? (aja, + a’ ,a_x + aia’, + a_,a;) (9) 
k k#0 

Clearly, we get a gapless mode since up to an additive constant the second term can be written 

as E40 Gua} +a_,)(a,+ a’ ,) and so the orthogonal combination CH — a_,) creates a 

gapless excitation of momentum k. The crucial point is that (ag) = (a4 ) £0 and not merely 


(aja) # 0: In that case we would only get a gapped mode with energy (2Gu? + k*/2m). 
This shows that breaking the U (1) symmetry of boson number is essential. 


Exercises 


V.1.1. Verify that the approximation used to reach (3) is consistent. 


V.1.2. To confine the superfluid in an external potential W(x) we would add the term 
—W(x)o'(X, t)@(x, t) to (1). Derive the corresponding equation of motion for 
y. The equation, known as the Gross-Pitaevski equation, has been much studied 
in recent years in connection with the Bose-Einstein condensate. 


Chapter V.2 


Euclid, Boltzmann, Hawking, and Field 
Theory at Finite Temperature 


Statistical mechanics and Euclidean field theory 


I mentioned in Chapter I.2 that to define the path integral more rigorously we 
should perform a Wick rotation t = —it;. The scalar field theory, instead of being 
defined by the Minkowskian path integral 


ra =| Deel!) f ax13Ger-VO)] (1) 
is then defined by the Euclidean functional integral 


z= f Dee OM J aesthGe V0 f dye time) (2) 


where d¢x = —id?.x, with ddx =dtgd—v. In (1) (09)? = (89/81)? — (Vo), 
while in (2) (8g)? = (89/8t,)* + (Vy): The notation is a tad confusing but I am 
trying not to introduce too many extraneous symbols. You may or may not find it 
helpful to think of (Vv) + V(q) as one unit, untouched by Wick rotation. I have 
introduced E(y) = f déx[4(ag)? + V(g)], which may naturally be regarded as 
a static energy functional of the field g(x). Thus, given a configuration g(x) in 
d-dimensional space, the more it varies, the less likely it is to contribute to the 
Euclidean functional integral Z. 

The Euclidean functional integral (2) may remind you of statistical mechanics. 
Indeed, Herr Boltzmann taught us that in thermal equilibrium at temperature 
T = 1/8, the probability for a configuration to occur in a classical system or 
the probability for a state to occur in a quantum system is just the Boltzmann 
factor e~?* suitably normalized, where E is to be interpreted as the energy of 
the configuration in a classical system or as the energy eigenvalue of the state in 
a quantum system. In particular, recall the classical statistical mechanics of an 
N-particle system for which 


1 
E(p. 9) =} Pi + Vd dass Qw) 
i 
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The partition function is given (up to some overall constant) by 
z=|| J dp,dqje PE? 
i 


After doing the integrals over p we are left with the (reduced) partition function 


Z= TT f aac n--0m 
i 


Promoting this to a field theory as in Chapter [.3, letting i + x and gq; > g(x) as 
before, we see that the partition function of a classical field theory with the static 
energy functional E(g) has precisely the form in (2), upon identifying the symbol 
h as the temperature T = 1/f. Thus, 


Functional integral representation of the quantum partition function 


More interestingly, we move on to quantum statistical mechanics. The integration 
over phase space {p, g} is replaced by a trace, that is, a sum over states: Thus the 
partition function of a quantum mechanical system (say of a single particle to be 
definite) with the Hamiltonian H is given by 


Z=tr e PH LY" (nle P¥ |n) 


ia 


In Chapter I.2 we worked out the integral representation of (F|e~!”7 |Z). (You 
should not confuse the time 7 with the temperature T of course.) Suppose we want 
an integral representation of the partition function. No need to do any more work! 
We simply replace the time T by —if, set |7) = |F) = |n) and sum over |n) to 
obtain 


B 
Z=tr e P4 [. Dae fy @eL(@) (4) 


Tracing the steps from (1.2.3) to (1.2.5) you can verify that here L(g) = 
1(dq/dt)* + V(q) is precisely the Lagrangian corresponding to H in the Eu- 
clidean time t. The integral over t runs from 0 to 8. The trace operation sets the 
initial and final states equal and so the functional integral should be done over all 
paths q(t) with the boundary condition g(0) = q(B). The subscript PBC reminds 
us of this all important periodic boundary condition. 

The extension to field theory is immediate. If H is the Hamiltonian of a quan- 
tum field theory in D-dimensional space [and hence d = (D + 1)-dimensional 
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spacetime], then the partition function (4) is 
B 
Z tr ev BH -[ Dye fo dt f d?xk) 65) 
PBC 


with the integral evaluated over all paths y(x, 7) such that 
g(x, 0) = (x, B) (6) 


(Here ¢ represents all the Bose fields in the theory.) 

A remarkable result indeed! To study a field theory at finite temperature all we 
have to do is rotate it to Euclidean space and impose the boundary condition (6). 
Thus, 


Euclidean quantum field theory in (D + 1)-dimensional 


spacetime, O< Tt < f (7) 
~ Quantum statistical mechanics in D-dimensional space 


In the zero temperature limit > oo we recover from (5) the standard Wick- 
rotated quantum field theory over an infinite spacetime, as we should. 

Surely you would hit it big with mystical types if you were to tell them that 
temperature is equivalent to cyclic imaginary time. At the arithmetic level this 
connection comes merely from the fact that the central objects in quantum physics 
e HT and in thermal physics e~*” are formally related by analytic continuation. 
Some physicists, myself included, feel that there may be something profound here 
that we have not quite understood. 


Finite temperature Feynman diagrams 


If we so desire, we can develop the finite temperature perturbation theory of 
(5), working out the Feynman rules and so forth. Everything goes through as 
before with one major difference stemming from the condition (6) g(x, t = 0) 
= y(X, t =f). Clearly, when we Fourier transform with the factor e'®*, the 
Euclidean frequency @ can take on only discrete values w, = (27 /B)n, with n an 
integer. The propagator of the scalar field becomes 1/(k? + k2) > 1/(@2 + k?). 
Thus, to evaluate the partition function, we simply write the relevant Euclidean 
Feynman diagrams and instead of integrating over frequency we sum over a 
discrete set of frequencies w, = (27T)n, n = —o0, --- , +00. In other words, 
after you beat a Feynman integral down to the form [ dkF (ke) all you have to 
do is replace it by 27 T ¥_, f dPkF[(Q2nT)?n? + R?], 

It is instructive to see what happens in the high temperature T — oo limit. In 
summing over w,, the n = 0 term dominates since the combination (21 T)?n? + Kk? 
occurs in the denominator. Hence, the diagrams are evaluated effectively in D- 
dimensional space. We lose a dimension! Thus, 
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Euclidean quantum field theory in D-dimensional spacetime 
~ High-temperature quantum statistical mechanics in (8) 


D-dimensional space 


This is just the statement that at high temperature quantum statistical mechanics 
goes classical [Compare (3]. 

An important application of quantum field theory at finite temperature is to 
cosmology: The early universe may be described as a soup of elementary particles 
at some high temperature. 


Hawking radiation 


Hawking radiation from black holes is surely the most striking prediction of 
gravitational physics of the last few decades. The notion of black holes goes all 
the way back to Laplace, who noted that the escape velocity from a sufficiently 
massive object may exceed the speed of light. Classically, things fall into black 
holes and that’s that. But with quantum physics a black hole can in fact radiate like 
a black body at a characteristic temperature T. 

Remarkably, with what little we learned in Chapter [1.10 and here, we can 
actually determine the black hole temperature. I hasten to add that a systematic 
development would be quite involved and fraught with subtleties; indeed, entire 
books are devoted to this subject. However, what we need to dois more or less clear. 
Starting with Chapter I.10, we would have to develop quantum field theory (for 
instance, that of a scalar field g) in curved spacetime, in particular in the presence 
of a black hole, and ask what a vacuum state (i.e., a state devoid of ¢ quanta) in the 
far past evolves into in the far future. We would find a state filled with a thermal 
distribution of g quanta. We will not do this here. 

In hindsight, people have given numerous heuristic arguments for Hawking 
radiation. Here is one. Let us look at the Schwarzschild solution (see Chapter I.10) 


M\"! 
ds*= (1 _ eM) dt? — (1 = =) dr? — r*d0? — r* sin 6 dd? (9) 
r r 


At the horizon r = 2GM, the coefficients of dt? and dr? change sign, indicating 
that time and space, and hence energy and momentum, are interchanged. Clearly, 
something strange must occur. With quantum fluctuations, particle and antiparticle 
pairs are always popping in and out of the vacuum, but normally, as we had 
discussed earlier, the uncertainty principle limits the amount of time Ar the pairs 
can exist to ~ 1/ AE. Near the black hole horizon, the situation is different. A pair 
can fluctuate out of the vacuum right at the horizon, with the particle just outside the 
horizon and the antiparticle just inside; heuristically the Heisenberg restriction on 
At may be evaded since what is meant by energy changes as we cross the horizon. 
The antiparticle falls in while the particle escapes to spatial infinity. Of course, a 
hand-waving argument like this has to be backed up by detailed calculations. 
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If black holes do indeed radiate at a definite temperature 7 , and that is far from 
obvious a priori, we can estimate T easily by dimensional analysis. From (9) we 
see that only the combination GM, which evidently has the dimension of a length, 
can come in. Since T has the dimension of mass, that is, length inverse, we can 
only have T «x 1/GM. 

To determine T precisely, we resort to a rather slick argument. I warn you from 
the outset that the argument will be slick and should be taken with a grain of salt. 
It is only meant to whet your appetite for a more correct treatment. 

Imagine quantizing a scalar field theory in the Schwarzschild metric, along the 
line described in Chapter I.10. If upon Wick rotation the field “feels” that time is 
periodic with period £, then according to what we have learned in this chapter 
the quanta of the scalar field would think that they are living in a heat bath with 
temperature T = 1/B. 

Setting t > —it, we rotate the metric to 


—1 
+ (1 _ 26M ) dr? +r°d@? +r? sin’ oad | (10) 
r 


In the region just outside the horizon r 2 2G M, we perform the general coordinate 
transformation (rt, r) > (@, R) defined by Rda =(1—2GM/r)2dt and dR = 
d_—- 2GM /r)~2dr so that the first two terms in ds? become R*da? + dR’, 
namely the length element squared of flat 2-dimensional Euclidean space in polar 
coordinates. Thus, the Euclidean time t is proportional to the polar angle a. 
Denote the period of t by 8. Since we are interested in what happens just outside 


the horizon, we need to determine (a, R) only to leading order in (r — 2GM). 
Integrating the two differential equations we obtain 


B~ (2GM)~3(r — 2GM)2B 


Rn) =a —22™)3 
r 


and 


R= f a= Ey tar'=eem? | (r' — 2GM)~ 2dr’ 
2GM r’ 2GM 


= (2GM)22(r — 2GM)?2 
Dividing, we find B = (277)2(2GM) = 8%GM and thus the Hawking temperature 


T= 1 Ac? 
 81GM 8&1GM 


(1) 


Restoring #, we see that Hawking radiation is indeed a quantum effect. 
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V.2.3. 
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Exercises 


Study the free field theory & = 1(8)? — 4m’ at finite temperature and derive 
the Bose-Einstein distribution. 


It probably does not surprise you that for fermionic fields the periodic bound- 
ary condition (6) is replaced by an antiperiodic boundary condition w(x, 0) = 
—y (x, 8) in order to reproduce the results of Chapter II.5. Prove this by looking 
at the simplest fermionic functional integral. [Hint: The clearest exposition of this 
satisfying fact may be found in Appendix A of R. Dashen, B. Hasslacher, and A. 
Neveu, Phys. Rev. D12: 2443, 1975. 


It is interesting to consider quantum field theory at finite density, as may occur in 
dense astrophysical objects or in heavy ion collisions. (In the previous chapter we 
studied a system of bosons at finite density and zero temperature.) In statistical 
mechanics we learned to go from the partition function to the grand partition 
function Z = tr e-8(4-#%), where a chemical potential jz is introduced for 
every conserved particle number N. For example, for noninteracting relativistic 
fermions, the Lagrangian is modified to L = W(i 2 — m)w + uy. Note that 
finite density, as well as finite temperature, breaks Lorentz invariance. Develop 
the subject of quantum field theory at finite density as far as you can. 


Chapter V.3 


Landau-Ginzburg Theory of 
Critical Phenomena 


The emergence of nonanalyticity 


Historically, the notion of spontaneous symmetry breaking came into particle 
physics from condensed matter physics, originating in the work of Landau and 
Ginzburg on second-order phase transitions. 

Consider a ferromagnetic material in thermal equilibrium at temperature T. 
The magnetization M (x) is defined as the average of the atomic magnetic mo- 
ments taken over a region of a size much larger than the length scale characteristic 
of the relevant microscopic physics. (In this chapter, we are discussing a nonrel- 
ativistic theory and x denotes the spatial coordinates only.) We know that at low 
temperatures, rotational invariance is spontaneously broken and that the material 
exhibits a bulk magnetization pointing in some direction. As the temperature is 
raised past some critical temperature T. the bulk magnetization suddenly disap- 
pears. We understand that with increased thermal agitation the atomic magnetic 
moments point in increasingly random directions, canceling each other out. More 
precisely, it was found experimentally that just below T. the magnetization \[M | 
vanishes as ~ (JT. — T)®, where the so-called critical exponent 6 ~ 0.37. 

This sudden change is known as a second order phase transition, an example 
of a critical phenomenon. Historically, critical phenomena presented a challenge 
to theoretical physicists. In principle, we are to compute the partition function 
Z =tr e—*/T with the microscopic Hamiltonian J, but Z is apparently smooth in 
T except possibly at T = 0. Some physicists went as far as saying that nonanalytic 
behavior such as (T. — T)? is impossible and that within experimental error |\M | 
actually vanishes as a smooth function of T. Part of the importance of Onsager’s 
famous exact solution in 1944 of the 2-dimensional! Ising model is that it settled 
this question definitively. The secret is that an infinite sum of terms each of which 
may be analytic in some variable need not be analytic in that variable. The trace 
intr e~*“/T sums over an infinite number of terms. 
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Arguing from symmetry 


In most situations, it is essentially impossible to calculate Z starting with the 
microscopic Hamiltonian. Landau and Ginzburg had the brilliant insight that the 
form of the free energy G as a function of M for a system with volume V could be 
argued from general principles. First, for M constant in x, we have by rotational 
invariance 


G=VlaM? +b(M’)* +---] (1) 


where a, b, - - - ate unknown (but expected to be smooth) functions of T. Landau 
and Ginzburg supposed that a vanishes at some temperature T.. Unless there is 
some special reason, we would expect a = a,(T — T,) + ---~ [rather than, say, 
a =a,(T ~ T.)? +--+], But you already learned in Chapter IV.1 what would 
happen. For T > T,, G is minimized at M =0, but as T drops below T,, new 
minima suddenly develop at |M | =./(-—a/2b) ~ (T. — T)2 . Rotational symmetry 
is spontaneously broken, and the mysterious nonanalytic behavior pops out easily. 

To include the possibility of M varying in space, Landau and Ginzburg argued 
that G must have the form 


G= J d?x{8,M9,M + aM? + b(M?)? +--+} (2) 


where the coefficient of the (a,M )? term has been set to 1 by rescaling M. You 
would recognize (2) as the Euclidean version of the scalar field theory we have 
been studying. By dimensional analysis we see that 1/,/a sets the length scale. 
More precisely, for T > T,, let us turn on a perturbing external magnetic field 
H(x) by adding the term —H-M. Assuming M small and minimizing G we 
obtain (—d* + a)M ~ H, with the solution 


ak eik- (x—- y) _ 
M 3 y[ -_____H 
O= | | Oa Bea ” 


1 _ Ales) 2 


[Recall that we did the integral in (1.4.7)—admire the unity of physics!] 

It is standard to define a correlation function < M (x)M (0) > by asking what 
the magnetization M (x) will be if we use a magnetic field sharply localized at the 
origin to create a magnetization M (0) there. We expect the correlation function 
to die off as e~!/§ over some correlation length € that goes to infinity as T 
approaches T. from above. The critical exponent v is traditionally defined by 
g ~ V(t ~ T,)”. 

In Landau-Ginzburg theory, also known as mean field theory, we obtain § = 
1/./a and hence v = }. 

The important point is not how well the predicted critical exponents such as B 
and v agree with experiment but how easily they emerge from Landau-Ginzburg 
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theory. The theory provides a starting point for a complete theory of critical 
phenomena, which was eventually developed by Kadanoff, Fisher, Wilson, and 
others using the renormalization group (to be discussed in Chapter VI.8). 

The story goes that Landau had a logarithmic scale with which he ranked 
theoretical physicists, with Einstein on top, and that after working out Landau- 
Ginzburg theory he moved himself up by half a notch. 


Exercise 


V.3.1. Another important critical exponent y is defined by saying that the susceptibility 
xX =(0M/0H)|yW=5 diverges ~ 1/{T — T.|” as T approaches T.. Determine y 
in Landau-Ginzburg theory. [Hint: Instructively, there are two ways of doing it: 
(a) Add —H-M to (1) for M and H constant in space and solve for M(H). 
(b) Calculate the susceptibility function x;;(x — y) = [0M;(x)/H;(y)]l y= and 
integrate over space.] 


Chapter V.4 


Superconductivity 


Pairing and condensation 


When certain materials are cooled below a certain critical temperature 7, they 
suddenly become superconducting. Historically, physicists had long suspected that 
the superconducting transition, just like the superfluid transition, has something to 
do with Bose-Einstein condensation. But electrons are fermions, not bosons, and 
thus they first have to pair into bosons, which then condense. We now know that 
this general picture is substantially correct: Electrons form Cooper pairs, whose 
condensation is responsible for superconductivity. 

With brilliant insight, Landau and Ginzburg realized that without having to 
know the detailed mechanism driving the pairing of electrons into bosons, they 
could understand a great deal about superconductivity by studying the field g(x) 
associated with these condensing bosons. In analogy with the ferromagnetic tran- 
sition in which the magnetization M(x) ina ferromagnet suddenly changes from 
zero to a nonzero value when the temperature drops below some critical temper- 
ature, they proposed that g(x} becomes nonzero for temperatures below T,. (In 
this chapter x denotes spatial coordinates only.) In statistical physics, quantities 
such as M (x) and g(x) that change through a phase transition are known as order 
parameters. 

The field g(x) carries two units of electric charge and is therefore complex. The 
discussion now unfolds much as in Chapter V.3 except that 8,¢ should be replaced 
by D;y = (0; — i2eA;)¢@ since ¢ is charged. Following Landau and Ginzburg and 
including the energy of the external magnetic field, we write the free energy as 


1 b 
F =F; + Dip? +alel + Sieh + (1) 


which is clearly invariant under the U(1) gauge transformation g > e'?¢4g and 
A; > A; + 0;A. As before, setting the coefficient of |D;|* equal to 1 just amounts 
to a normalization choice for ¢. 


The similarity between (1) and (IV.6.1) should be evident. 
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Meissner effect 


A hallmark of superconductivity is the Meissner effect, in which an external 
magnetic field B permeating the material is expelled from it as the temperature 
drops below T... This indicates that a constant magnetic field inside the material is 
not favored energetically. The effective laws of electromagnetism in the material 
must somehow change at T.. Normally, a constant magnetic field would cost an 
energy of the order ~ BV, where V is the volume of the material. Suppose that the 
energy density is changed from the standard B* to A? (where as usual V x A=B). 
For a constant magnetic field B,A grows as the distance and hence the total energy 
would grow faster than V. After the material goes superconducting, we have to pay 
an unacceptably large amount of extra energy to maintain the constant magnetic 
field and so it is more favorable to expel the magnetic field. 

Note that a term like A? in the effective energy density preserves rotational 
and translational invariance but violates electromagnetic gauge invariance. But we 
already know how to break gauge invariance from Chapter IV.6. Indeed, the (1) 
gauge theory described there and the theory of superconductivity described here 
are essentially the same, related by a Wick rotation. 

As in Chapter V.3 we suppose that for temperature T ~ T., a ~ a\(T — T.) 
while b remains positive. The free energy J is minimized by g = 0 above T,., and 
by |g| = /—a/b = v below T.. All this is old hat to you, who have learned that 
upon symmetry breaking in a gauge theory the gauge field gains a mass. We simply 
read off from (1) that 


F = F;, + (2ev)? A? +--+: (2) 


which is precisely what we need to explain the Meissner effect. 


London penetration length and coherence length 


Physically, the magnetic field does not drop precipitously from some nonzero 
value outside the superconductor to zero inside, but drops over some characteristic 
length scale, called the London penetration length. The magnetic field leaks into 
the superconductor a bit over a length scale /, determined by the competition 
between the energy in the magnetic field Fe ~ (@A)* ~ A7/I* and the Meissner 
term (2ev)”A” in (2). Thus, Landau and Ginzburg obtained the London penetration 
length 1; ~ (1/ev) = (1/e)./b/—a. 

Similarly, the characteristic length scale over which the order parameter ¢ varies 
is known as the coherence length /,, which can be estimated by balancing the 
second and third terms in (1), roughly (8g)? ~ g?/ Ie and ag” against each other, 
giving a coherence length of order |, ~ 1/./—a. 

Putting things together, we have 


1, Mb 3) 
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You might recognize from Chapter IV.6 that this is just the ratio of the mass of the 
scalar field to the mass of the vector field. 

As Iremarked earlier, the concept of spontaneous symmetry breaking went from 
condensed matter physics to particle physics. After hearing a talk at the University 
of Chicago on the Bardeen-Cooper-Schrieffer theory of superconductivity by 
the young Schrieffer, Nambu played an influential role in bringing spontaneous 
symmetry breaking to the particle physics community. 


Exercises 


V.4.1. Vary (1) to obtain the equation for A and determine the London penetration length 
more carefully. 


V.4.2. Determine the coherence length more carefully. 


Chapter V.5 
Peierls Instability 


Noninteracting hopping electrons 


The appearance of the Dirac equation and a relativistic field theory in a solid would 
be surprising indeed, but yes, it is possible. 
Consider the Hamiltonian 


H=-t Yoel ye; topes (1) 
j 


describing noninteracting electrons hopping on a 1-dimensional lattice (Figure 
V.5.1). Here c; annihilates an electron on site j. Thus, the first term describes an 
electron hopping from site j to site 7 + 1 with amplitude t. We have suppressed 
the spin labels. This is just about the simplest solid state model; a good place to 
read about it is in Feynman’s “Freshman lectures.” 

Fourier transforming c; = )°; e'*4j c(k) (where a is the spacing between sites), 
we immediately find the energy spectrum e(k) = —2t cos ka (Fig. V.5.2). Imposing 
a periodic boundary condition on a lattice with N sites, we have k = (22 /Na)n 
with n an integer from —5N to 3N. As N — oo, k becomes a continuous rather 
than a discrete variable. As usual, the Brillouin zone is defined by —2/a <k < 
n/a. 

There is absolutely nothing relativistic about any of this. Indeed, at the bottom 
of the spectrum the energy (up to an irrelevant additive constant) goes as e(k) ~ 
2t} (ka)? = k?/2mes¢. The electron disperses nonrelativistically with an effective 
Mass Moff. 

But now let us fill the system with electrons up to some Fermi energy ¢; (see 
fig. (2).) Focus on an electron near the Fermi surface and measure its energy from 
€, and momentum from +k,,. Suppose we are interested in electrons with energy 
small compared to €,-, thatis, E = € — ¢; < ¢,, and momentum small compared 
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€(k) 


q|a 
9|a 


Figure V.5.2 


tok,, thatis, p =k —k; <k,. These electrons obey a linear energy-momentum 
dispersion E = vp with the Fermi velocity vp = (0€/0k)|,—-;,,- We will call the 
field associated with these electrons yp, where the subscript indicates that they 
are “right moving.” It satisfies the equation of motion (0/dt + v-0/dx)Wp = 0. 

Similarly, the electrons with momentum around —k, obey the dispersion E = 
—v,p. We will call the field associated with these electrons yw, with L for “left 
moving,” satisfying (0/dt + v,-d/dx)w, =0. 


Emergence of the Dirac equation 


The Lagrangian summarizing all this is simply 


0 
o=ivh (2 + Up out id (2 - ne) Wr (2) 


Introducing a 2-component field yy = ( YE ) ,v=viy® = w'o,, and choosing 
units so that vp = 1, we may write £ more compactly as 


~jyif{%® 6% = ive 
Lip (: aw) y Wi" 9, (3) 


with y° = a, and y! = ia; satisfying the Clifford algebra {y”, y”} = 2g”. 
Amazingly enough, the (1 + 1)-dimensional Dirac Lagrangian emerges in a 
totally nonrelativistic situation! 
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An instability 


I will now go on to discuss an important phenomenon known as Peierls’s instability. 
I will necessarily have to be a bit sketchy. I don’t have to tell you again that this is 
not a text on solid state physics, but in any case you will not find it difficult to fill 
in the gaps. 

Peierls considered a distortion in the lattice, with the ion at site j displaced 
from its equilibrium position by cos[q(ja)]. (Shades of our mattress from Chapter 
1.2!) A lattice distortion with wave vector g = 2k, will connect electrons with 
momentum k; with electrons with momentum —k,. In other words, it connects 
right moving ones with left moving electrons, or in our field theoretic language 
Wp with w,. Since the right moving electrons and the left moving electrons on 
the surface of the Fermi sea have the same energy (namely ¢,, duh!) we have the 
always interesting situation of degenerate perturbation theory: ( “% + ? 3 
with eigenvalues ¢, + 6. A gap opens at the surface of the Fermi sea. Here 6 
represents the perturbation. Thus, Peierls concluded that the spectrum changes 
drastically and the system is unstable under a perturbation with wave vector 2k;. 

A particularly interesting situation occurs when the system is half filled with 
electrons (so that the density is one electron per site—recall that electrons have 
up and down spin). In other words, k; = 2/2a and thus 2k, = /a. A lattice 
distortion of the form shown in Fig. V.5.3 has precisely this wave vector. Peierls 
showed that a half-filled system would want to distort the lattice in this way, 
doubling the unit cell. It is instructive to see how this physical phenomenon 
emerges in a field theoretic formulation. 

Denote the displacement of the ion at site 7 by d;. In the continuum limit, we 
should be able to replace d; by a scalar field. Show that a perturbation connecting 
Wpand wy, couples to yy and yyw, and that a linear combination yy and Py *w 
can always be rotated to yy by a chiral transformation (see Exercise V.5.1.) Thus, 
we extend (3) to 


L=piy"d,e + $1(8,9)° — 00,9) 1- sue + gov t-+. (4) 


Remember that you worked out the effective potential V.g-(g) of this (1 + 1)- 
dimensional field theory in exercise (IV.3.2): Ver (@) goes as v7 log y? for small ¢, 
which overwhelms the 422? term. Thus, the symmetry ¢ > —q is dynamically 
broken. The field g acquires a vacuum expectation value and y becomes massive. 
In other words, the electron spectrum develops a gap. 
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Exercise 


V.5.1. Parallel to the discussion in Chapter [1.1 you can see easily that the space of 2 by 
2 matrices is spanned by the four matrices J, y“, and y> = y°y! = 43. (Note the 
peculiar but standard notation of y*.) Convince yourself that 3 + y>) projects 
out right- and left handed fields just as in (3 + 1)-dimensional spacetime. Show 
that in the bilinear yyy left handed fields are connected to left handed fields 
and right handed to right handed and that in the scalar yy and the pseudoscalar 
wy right handed is connected to left handed and vice versa. Finally, note that 
under the transformation yy > e? ory the scalar and the pseudoscalar rotate into 
each other. Check that this transformation leaves the massless Dirac Lagrangian 
(3) invariant. 


Chapter V.6 


Solitons 


Breaking the shackles of Feynman diagrams 


When I teach quantum field theory I like to tell the students that by the mid-1970s 
field theorists were breaking the shackles of Feynman diagrams. A bit melodra- 
matic, yes, but by that time Feynman diagrams, because of their spectacular suc- 
cesses in quantum electrodynamics, were dominating the thinking of many field 
theorists, perhaps to excess. As a student I was even told that Feynman diagrams 
define quantum field theory, that quantum fields were merely the “slices of veni- 
son”! used to derive the Feynman rules, and should be discarded once the rules 
were obtained. The prevailing view was that it barely made sense to write down 
p(x). This view was forever shattered with the discovery of topological solitons, 
as we will now discuss. 


Small oscillations versus lumps 


Consider once again our favorite toy model £ = !(4y)* — V(g) with the infamous 
double-well potential V(~) = (A/4)(g? — v?)? in (1 + 1)-dimensional spacetime. 
In Chapter IV.1 we learned that of the two vacua yg = +v we are to pick one and 
study small oscillations around it. So, pick one and write g = v + x, expand £ in 


x, and study the dynamics of the x meson with mass pz = (Av2)2. Physics then 
consists of suitably quantized waves oscillating about the vacuum v. 

But that is not the whole story. We can also have a time independent field 
configuration with g(x) (in this and the next chapter x will denote only space 
unless it is clearly meant to be otherwise from the context) taking on the value 
—v asx — —oo and+v as x > +00, and changing from —v to +v around some 
point xg over some length scale / as shown in Figure V.6.1a. [Note that if we 
consider the Euclidean version of the field theory, identify the time coordinate as 
the y coordinate, and think of g(x, y) as the magnetization (as in Chapter V.4), 


‘Gell-Mann used to speak about how pheasant meat is cooked in France between two 
slices of venison which are then discarded. He forcefully advocated a program to extract 
and study the algebraic structure of quantum field theories which are then discarded. 
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Figure V.6.1 


then the configuration here describes a “domain wall” in a 2-dimensional magnetic 


system.] 
Think about the energy per unit length 
s(x) = = (<2) +4g-vy (1) 
2 \dx 4 ? 


for this configuration, which I plot in Figure V.6.1b. Far away from xg we are in one 
of the two vacua and there is no energy density. Near xg, the two terms in €(x) both 
contribute to the energy or mass M = f{ dx €(x): the “spatial variation” (in a slight 
abuse of terminology often called the “kinetic energy”) term f dx4(dy/dx)* ~ 
I(v/1)? ~ v?/1, and the “potential energy” term f dxrA(g? - vy’)? ~ Av. To 
minimize the total energy the spatial variation term wants / to be large, while the 
potential term wants / to be small. The competition dM /dl =0 gives v?/1 ~ lv’, 
thus fixing / ~ (Av2)~2 ~ 1/2. The mass comes out to be ~ pv? ~ pu(w2/A). 
We have a lump of energy spread over a region of length / of the order of the 
Compton wavelength of the x meson. By translation invariance, the center of the 
lump x9 can be anywhere. Furthermore, since the theory is Lorentz invariant, we 
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can always boost to send the lump moving at any velocity we like. Recalling a 
famous retort in the annals of American politics (“It walks like a duck, quacks like 
a duck, so Mr. Senator, why don’t you want to call it a duck?”) we have here a 
particle, known as a kink or a soliton, with mass ~ p(y? /A) and size ~ I. Perhaps 
because of the way the soliton was discovered, many physicists think of it as a big 
lumbering object, but as we have seen, the size of a soliton ] ~ 1/2 can be made 
as small as we like by increasing jz. So a soliton could look like a point particle. 
We will come back to this point in Chapter VI.3 when we discuss duality. 


Topological stability 


While the kink and the meson are the same size, for small 1 the kink is much more 
massive than the meson. Nevertheless, the kink cannot decay into mesons because 
it costs an infinite amount of energy to undo the kink [by “lifting” g(x) over the 
potential energy barrier to change it from +v to —v for x from some point = xg 
to +00, for example.] The kink is said to be topologically stable. 

The stability is formally guaranteed by the conserved current 


Jt= ap (2) 


with the charge 


+00 1 
Q -/ dx J°(x) = ay eit) — 9(—o0)] 


Mesons, which are small localized packets of oscillations in the field clearly have 
Q =0, while the kink has Q = 1. Thus, the kink cannot decay into a bunch of 
mesons. Incidentally, the charge density J° = (1/2v)(dg/dx) is concentrated at 
Xq Where g changes most rapidly, as you would expect. 

Note that 3,,/“ = 0 follows immediately from the antisymmetric symbol e*” 
and does not depend on the equation of motion. The current J“ is known as 
a “topological current.” Its existence does not follow from Noether’s theorem 
(Chapter I.9) but from topology. 

Our discussion also makes clear the existence of an antikink with Q = —1 and 
described by a configuration with g(—oo) = +v and g(++00) = —v. The name is 
justified by considering the configuration pictured in Figure V.6.2 containing a kink 
and an antikink far apart. As the kink and the antikink move closer to each other, 
they clearly can annihilate into mesons, since the configuration shown in Figure 
V.6.2 and the vacuum configuration with g(x) = +v everywhere are separated by 
a finite amount of energy. 
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Antikink Kink 


Figure V.6.2 


A nonperturbative phenomenon 


That the mass of the kink comes out inversely proportional to the coupling A is 
a clear sign that field theorists could have done perturbation theory in A. till they 
were blue in the face without ever discovering the kink. Feynman diagrams could 
not have told us about it. 

You can calculate the mass of a kink by minimizing 


w-fal(E) hem) 
-(E)e fol (B ir-y) 


where in the last step we performed the obvious scaling g(x) > uf (y) and y = 
px. This scaling argument immediately showed that the mass of the kink M = 
a(u?/A) with a a pure number: The heuristic estimate of the mass proved to 


be highly trustworthy. The actual function g(x) and hence a can be computed 
straightforwardly with standard variational methods. 


Bogomol’nyi inequality 


More cleverly, observe that the energy density (1) is the sum of two squares. Using 
a* + b* > 2iab| we obtain 


mz f ax(% : (2 (v -v) = (4) 13 _ yy? re 
= 2) (ax) \” =(5) [3% PP | 
4 ) 
-|-.u(#) 0 
sat (5 
We have the elegant result 


M > |Q| (3) 


V.6. Solitons 281 


with mass M measured in units of (4/3/2)4:(7/A). This is an example of a 
Bogomol’nyi inequality, which plays an important role in string theory. 
Exercises 


V.6.1. Show that if g(x) is a solution of the equation of motion, then so is g[(x — vt)/ 


V1 — v2]. 


V.6.2. Discuss the solitons in the so-called sine-Gordon theory & = 1(ag)’— 
g cos(B¢g). Find the topological current. Is the Q = 2 soliton stable or not? 


V.6.3. Compute the mass of the kink by the brute force method and check the result from 
the Bogomol’nyi inequality. 


Chapter V.7 


Vortices, Monopoles, and Instantons 


Vortices 


The kink is merely the simplest example of a large class of topological objects in 
quantum field theory. 

Consider the theory of a complex scalar field in (2 + 1)-dimensional spacetime 
L= ay dy — Aye — v)? with the now familiar Mexican hat potential. With 
some minor changes in notation, this is the theory we used to describe interacting 
bosons and superfluids. (We choose to study the relativistic rather than the non- 
relativistic version but as you will see the issue does not enter for the questions I 
want to discuss here.) 

Are there solitons, that is, objects like the kink, in this theory? 

Given some time-independent configuration g(x) let us look at its mass or 
energy 


M= il d?x[8,9'8,9 + A(@'g — v)?}. (1) 


The integrand is a sum of two squares, each of which must give a finite contribution. 
In particular, for the contribution of the second term to be finite the magnitude of 
y must approach v at spatial infinity. 

This finite energy requirement does not fix the phase of g however. Using polar 
coordinates (r, #) we will consider the Ansatz ¢ —> ve’, Writing g = 9g, + i>, 
we see that the vector (vy), ¢)) = u(cos @, sin @) points radially outward at infinity. 
Recall the definition of the current J; = i (3,¢'y — v*9,@) in a bosonic fluid given 
in Chapter II].5. The flow whirls about at spatial infinity, and thus this configuration 
is known as the vortex. 

By explicit differentiation or dimensional analysis, we have 0;y ~ v(1/r) as 
r —> oo. Now look at the first term in 4. Oops, the energy diverges logarithmically 
as u* f d?x(1/r’). 

Is there a way out? Not unless we change the theory. 
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Vortex into flux tubes 


Suppose we gauge the theory by replacing 0,9 by D;g = 0,9 — ieA;p. Now we can 
achieve finite energy by requiring that the two terms in D;g knock each other out so 
that D;p 2,9 faster than 1/r. In other words, A; = ~(i/e)(1/|e|)¢'8;9 = 
(1/e)0,@. Immediately, we have 


Flux = / @xF i= § dx; A; = on (2) 
Cc é 


where C is an infinitely large circle at spatial infinity and we have used Stoke’s 
theorem. Thus, in a gauged U (1) theory the vortex carries a magnetic flux inversely 
proportional to the charge. When I say magnetic, I am presuming that A represents 
the electromagnetic gauge potential. The vortex discussed here appears as a flux 
tube in so-called type II superconductors. It is worth remarking that this fundamen- 
tal unit of flux (2) is normally written in the condensed matter physics literature in 
unnatural units as 


Oy = 7 (3) 


very pleasingly uniting three fundamental constants of Nature. 


Homotopy groups 


Since spatial infinity in 2 dimensional space is topologically a unit circle $ ' and 
since the field configuration with |g| = v also forms a circle S', this boundary 
condition can be characterized as a map S!— S!. Since this map cannot be 
smoothly deformed into the trivial map in which S! is mapped onto a point in 
S', the corresponding field configuration is indeed topologically stable. (Think of 
wrapping a loop of string around a ring.) 

Mathematically, maps of S” into a manifold M are classified by the homotopy 
group IT,,(M), which counts the number of topologically inequivalent maps. You 
can look up the homotopy groups for various manifolds in tables.! In particular, for 
n > 1,TI,,(S") = Z, where Z is the mathematical notation for the set of all integers. 
The simplest example IT ,(S') = Z is proved almost immediately by exhibiting the 
Maps ~ —>,_,o5 ve’’”®, with m any integer (positive or negative), using the context 
and notation of our discussion for convenience. Clearly, this map wraps one circle 
around the other m times. 

The language of homotopy groups is not just to impress people, but gives us a 
unifying language to discuss topological solitons. Indeed, looking back you can 


! See tables 6.V and 6.VI, S. lyanaga and Y. Kawada, eds., Encyclopedic Dictionary of 
Mathematics, p. 1415. 
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now see that the kink is a physical manifestation of TI9(S°) = Z, (since the 0- 
dimensional sphere S° = {+1, —1} consists of just two points and is topologically 
equivalent to the spatial infinity in 1-dimensional space.) 


Hedgehogs and monopoles 


If you absorbed all this, you are ready to move up to (3 + 1)-dimensional spacetime. 
Spatial infinity is now topologically S*. By now you realize that if the scalar field 
lives on the manifold M, then we have at infinity the map of S 2_, M.The simplest 
choice is thus to take S* for M. Hence, we are led to scalar fields yg? (a = 1, 2, 3) 
transforming as a vector ¢ under an internal symmetry group O(3) and governed 
by £ = 10g - 99 — V(@- 9). (There should be no confusion in using the arrow to 
indicate a vector in the internal symmetry group.) 

Let us choose V = A(¢* — v*)*. The story unfolds much as the story of the 
vortex. The requirement that the mass of a time independent configuration 


M= J d?x{1 (ag)? 4a@ - v9 (4) 


be finite forces |¢| = v at spatial infinity so that g(r = 00) indeed lives on S”. 

The identity map S* — S? indicates that we should consider a configuration 
such that 

a 
gy! > (5) 

roo or 
This equation looks a bit strange at first sight since it mixes the index of the internal 
symmetry group with the index of the spatial coordinates (but in fact we have 
already encountered this phenomenon in the vortex). At spatial infinity, the field 
¢ is pointing radially outward, so this configuration is known picturesquely as a 

hedgehog. Draw a picture if you don’t get it! 

As in the vortex story, the requirement that the first term in (4) be finite forces 
us to introduce an O(3) gauge potential A’ so that we can replace the ordinary 
derivative 0;,¢° by the covariant derivative D,y* = 3,g% + 264 APye, We can then 
arrange D;g* to vanish at infinity. Simple arithmetic shows that with (5) the gauge 
potential has to go as 


(6) 


Imagine yourself in a lab at spatial infinity. Inside a small enough lab, the ¢ 
field at different points are all pointing in approximately the same direction. The 
gauge group O (3) is broken down to O(2) ~ U (1). The experimentalists in this lab 
observe a massless gauge field associated with the U (1), which they might as well 
call the electromagnetic field (“quacks like a duck”). Indeed, the gauge invariant 
tensor field 
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Fie — £7 2(D_p)?(Dyp)* 
w= -_ —— ? (7) 


lvl elp/? 


can be identified as the electromagnetic field (see Exercise V.7.5). 

There is no electric field since the configuration i is time independent and A? = 0. 
We can only have a magnetic field B which you can | immediately calculate since 
you know A, but by symmetry we already see that B can point only in the radial 
direction. 

This is the fabled magnetic monopole first postulated by Dirac! 

The presence of magnetic monopoles in spontaneously broken gauge theory 
was discovered by ’t Hooft and Polyakov. If you calculate the total magnetic flux 
coming out of the monopole f d S - B, where as usual d § denotes a small surface 
element at infinity pointing radially outward, you will find that it is quantized in 
suitable units, exactly as Dirac had stated, as it must (recall Chapter IV.4). 

We can once again write a Bogomo!’nyi inequality for the mass of the monopole 


m= | ax [1G +100" +V@| (8) 
LF; ; transforms as a vector under O(3); recall ([V.5.17).] Observe that 


IF.) + 4(D,@) = 4F; + 8;;,D,6)° ¥ 5 8:4. Fij » Dy 


Thus 
M> | dx [Flank DO+VO| @) 
We next note that 
[ex Bin F ij * Db= f Px} £54. (F ij Gav | dB =anvg 


has an elegant interpretation in terms of the magnetic charge g of the monopole. 
Furthermore, if we can throw away V(¢) while keeping |¢@| = then the 


inequality M > 47v|g| is saturated by Fi; = +6;; -~D,@. The solutions of this 
equation are known as Bogomol’nyi-Prasad- Sommerfeld or BPS states. 

Itis not difficult to construct an electrically charged magnetic monopole, known 
as adyon. We simply take Ab = (x°/r) f(r) with some suitable function f(r). 

One nice feature of the topological monopole is that its mass comes out to 
be ~ My/a ~ 137My (Exercise V.7.11), where My denotes the mass of the 
intermediate vector boson of the weak interaction. We are anticipating Chapter 
VII.2 a bit in that the gauge boson that becomes massive by the Anderson-Higgs 
mechanism of Chapter IV.6 may be identified with the intermediate vector boson. 
This explains naturally why the monopole has not yet been discovered. 
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Instanton 


Consider a nonabelian gauge theory, and rotate the path integral Z = f DAe~*“) 
to 4-dimensional Euclidean space. We might wish to evaluate Z in the steepest 
descent approximation, in which case we would have to find the extrema of 


I 
4 
(A) = f anu FasFy 


with finite action. This implies that at infinity |x| = oo, F,,, must vanish faster 
than 1/|x|, and so the gauge potential A » Must be a pure gauge: A = gdg' for 
g an element of the gauge group [see (IV.5.6)]. Configurations for which this is 
true are known as instantons. We see that the instanton is yet one more link in the 
“great chain of being”: kink-vortex-monopole-instanton. 

Choose the gauge group SU(2) to be definite. In the parametrization g = 
X4+ix -& wehave by definition g'g = | and det g = 1, thus implying xf + X? = 1. 
We learn that the group manifold of SU (2) is 5°. Thus, in an instanton, the gauge 
potential at infinity A > ),)_,.. gdg’ + O(1/|x|*) defines a map 5° — S°. Sound 
familiar? Indeed, you’ve already seen S° > $°, S'— $!, S? + S? playing a role 
in field theory. 

Recall from Chapter IV.5 that tr F? = d tr(Ad A + 2.A%). Thus 


J tr F? = i tr(AdA + $A°) = i tr(AF — $A°) = —3 i tr(gdg*)?(10) 
3 8 3 


where we used the fact that F vanishes at infinity. This shows explicitly that f tr F 2 
depends only on the homotopy of the map S* —> S$? defined by g and is thus a 
topological quantity. Incidentally, f,; tr(gdg*)* is known to mathematicians as 
the Pontryargin index (see exercise V.7.12). 

I mentioned in Chapter IV.7 that the chiral anomaly is not affected by higher- 
order quantum fluctuations. You are now in position to give an elegant topological 
proof of this fact (Exercise V.7.13). 


Kosterlitz-Thouless transition 


We were a bit hasty in dismissing the vortex in the nongauged theory £ = dp ay — 
14¢'¢ — v*)? in (2 + 1)-dimensional spacetime. Around a vortex g ~ ve!® and so it 
is true, as we have noted, that the energy of a single vortex diverges logarithmically. 
But what about a vortex paired with an antivortex? 

Picture a vortex and an antivortex separated by a distance R large compared 
to the distance scales in the theory. Around an antivortex g ~ ve~'?. The field g 
winds around the vortex one way and around the antivortex the other way. Convince 
yourself by drawing a picture that at spatial infinity g does not wind at all: It just 
goes to a fixed value. The winding one way cancels the winding the other way. 
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Thus, a configuration consisting of a vortex-antivortex pair does not cost infinite 
energy. But it does cost a finite amount of energy: In the region between the 
vortex and the antivortex g is winding around, in fact roughly twice as fast 
(as you can see by drawing a picture). A rough estimate of the energy is thus 
v2 f d?x(1/r7) ~ v? log(R/a), where we integrated over a region of size R, the 
relevant physical scale in the problem. (To make sense of the problem we divide 
R by the size a of the vortex.) The vortex and the antivortex attract each other 
with a logarithmic potential. In other words, the configuration cannot be static: 
The vortex and the antivortex want to get together and annihilate each other in a 
fiery embrace and release that finite amount of energy v* log(R/a). (Hence the 
term antivortex.) 

All of this is at zero temperature, but in condensed matter physics we are 
interested in the free energy F = E — T S (with S the entropy) at some temperature 
T rather than the energy E. Appreciating this elementary point, Kosterlitz and 
Thouless discovered a phase transition as the temperature is raised. Consider a 
gas of vortices and antivortices at some nonzero temperature. Because of thermal 
agitation, the vortices and antivortices moving around may or may not find each 
other to annihilate. How high do we have to crank up the temperature for this to 
happen? 

Let us do a heuristic estimate. Consider a single vortex. Herr Boltzmann tells 
us that the entropy is the logarithm of the “number” of ways in which we can 
put the vortex inside a box of size L (which we will let tend to infinity). Thus, 
S ~ log(L/a). The entropy S is to battle the energy E ~ v” log(L/a). We see 
that the free energy F ~ (v* — T) log(L/a) goes to infinity if T < v2, which we 
identify as essentially the critical temperature T,. 

A single vortex cannot exist below T.. Vortices and antivortices are tightly 
bound below T, but are liberated above T,. 


Black hole 


The discovery in the 1970s of these topological objects that cannot be seen in 
perturbation theory came as a shock to the generation of physicists raised on 
Feynman diagrams and canonical quantization. People (including yours truly) 
were taught that the field operator g(x) is a highly singular quantum operator 
and has no physical meaning as such, and that quantum field theory is defined 
perturbatively by Feynman diagrams. Even quite eminent physicists asked in 
puzzlement what a statement such as = ve'® would mean. Learned discussions 
that in hindsight are totally irrelevant ensued. As I said in introducin g Chapter V.6, 
I like to refer to this historical process as “field theorists breaking the shackles of 
Feynman diagrams.” 

It is worth mentioning one argument physicists at that time used to convince 
themselves that solitons do exist. After all, the Schwarzschild black hole, defined 
by the metric g,,,,(x) (see Chapter I.10), had been known since 1916. Just what are 
the components of the metric g,,,,(x)? They are fields in exactly the same way that 
our scalar field g(x) and our gauge potential A,,(x) are fields, and in a quantum 
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theory of gravity g,,, would have to be replaced by a quantum operator just like g 
and A,,. So the objects discovered in the 1970s are conceptually no different from 
the black hole known in the 1910s. But in the early 1970s most particle theorists 
were not particularly aware of quantum gravity. 


V.7.1. 


V7.2. 
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V.7.6. 


V7.7. 


V7.8. 


V.7.9. 


V.7.10. 


V7.1. 


V.7.12. 


. Consider the vortex configuration in which g —> ve 


Exercises 


Explain the relation between the mathematical statement Ig(S°) = Z and the 
physical result that there are no kinks with |Q| > 2. 


In the vortex, study the length scales characterizing the variation of the fields ¢ 
and A. Estimate the mass of the vortex. 

iv? with v an integer. 
Calculate the magnetic flux. Show that the magnetic flux coming out of an 
antivortex (for which v = —1) is opposite to the magnetic flux coming out of 
a vortex. 

Mathematically, since g(@) = e'”? may be regarded as an element of the group 
U(1), we can speak of a map of S', the circle at spatial infinity, onto the group 
U(1). Calculate (i/27r) fo, gdg’, thus showing that the winding number is given 
by this integral of a t-form. 


Show that within a region in which * is constant, F,,,, as defined in the text is 
the electromagnetic field strength. Compute B far from the center of a magnetic 
monopole and show that Dirac quantization holds. 


Display explicitly the map S? + S?, which wraps one sphere around the other 
twice. Verify that this map corresponds to a magnetic monopole with magnetic 
charge 2. 

Write down the variational equations that minimize (8). 

Find the BPS solution explicitly. 

Discuss the dyon solution. Work it out in the BPS limit. 

Verify explicitly that the magnetic monopole is rotation invariant in spite of 
appearances. By this is meant that all physical gauge invariant quantities such 


as B are covariant under rotation. Gauge variant quantities such as A? can and do 
vary under rotation. Write down the generators of rotation. 


Show that the mass of the magnetic monopole is about 137My. 


Evaluate n = —(1/2477) f,, tr(gdg')? for the map g = FF (Hint: By sym- 
metry, you need calculate the integrand only in a small neighborhood of the 
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identity element of the group or equivalently the north pole of S*. Next, consider 
g =e! 101+ 0202+ 6303) for m an integer and convince yourself that m measures the 
number of times S? wraps around S?.] Compare with Exercise V.7.4 and admire 
the elegance of mathematics. 


Prove that higher order corrections do not change the chiral anomaly a, J = 
[1/(47r)*Je"*° tr FF, (1 have rescaled A — (1/g)A). (Hint: Integrate over 
spacetime and show that the left hand side is given by the number of right moving 
fermion quanta minus the number of left moving fermion quanta, so that both sides 
are given by integers. ] 


PART VI 
FIELD THEORY 


AND 
CONDENSED MATTER 


Chapter VI.1 


Fractional Statistics, Chern-Simons Term, 
and Topological Field Theory 


Fractional statistics 


The existence of bosons and fermions represents one of the most profound features 
of quantum physics. When we interchange two identical quantum particles, the 
wave function acquires a factor of either +1 or —1. Leinaas and Myrheim, and later 
Wilczek independently, had the insight to recognize that in (2 + 1)-dimensional 
spacetime particles can also obey statistics other than Bose or Fermi statistics, 
a Statistics now known as fractional or anyon statistics. These particles are now 
known as anyons. 

To interchange two particles, we can move one of them half-way around the 
other and then translate both of them appropriately. When you take one anyon 
half-way around another anyon going anticlockwise, the wave function acquires 
a factor of e? where @ is a real number characteristic of the particle. For 6 = 0, 
we have bosons and for @ = 7, fermions. Particles half-way between bosons and 
fermions, with @ = 7/2, are known as semions. 

After Wilczek’s paper came out, a number of distinguished senior physicists 
were thoroughly confused. Thinking in terms of Schrédinger’s wave function, 
they got into endless arguments about whether the wave function must be single 
valued. Indeed, anyon statistics provides a striking example of the fact that the 
path integral formalism is sometimes significantly more transparent. The concept 
of anyon statistics can be formulated in terms of wave functions but it requires 
thinking clearly about the configuration space over which the wave function is 
defined. 

Consider two indistinguishable particles at positions xi and x5 at some initial 
time that end up at positions xf and xf a time T later. In the path integral rep- 
resentation for (af , xf [efAT xi, x5) we have to sum over all paths. In spacetime, 
the worldlines of the two particles braid around each other (see Fig. VI.1.1). (We 
are implicitly assuming that the particles cannot go through each other, which is 
the case if there is a hard core repulsion between them.) Clearly, the paths can be 
divided into topologically distinct classes, characterized by an integer n equal to 
the number of times the worldlines of the two particles braid around each other. 
Since the classes cannot be deformed into each other, the corresponding amplitudes 


293 


294 VI. Field Theory and Condensed Matter 


yD 


Figure VL1.1 


cannot interfere quantum mechanically, and with the amplitudes in each class we 
are allowed to associate an additional phase factor e@» beyond the usual factor 
coming from the action. 

The dependence of a,, on n is determined by how quantum amplitudes are to 
be combined. Suppose one particle goes around the other through an angle Ag,, 
a history to which we assign an additional phase factor e’/‘4%) with f some 
as yet unknown function. Suppose this history is followed by another history in 
which our particle goes around the other by an additional angle Ag). The phase 
factor e'/(4¢1+4¢2) we assign to the combined history clearly has to satisfy the 
composition law e!f 4¢i+4¢2) — eff (Aen eif (42) Tn other words, f (Ag) has to be 
a linear function of its argument. 

We conclude that in (2 + 1)-dimensional spacetime we can associate with the 
quantum amplitude corresponding to paths in which one particle goes around 
the other anti-clockwise through an angle Ag a phase factor e!/")4%, with @ 
an arbitrary real parameter. Note that when one particle goes around the other 
clockwise through an angle Ag the quantum amplitude acquires a phase factor 
er 4e, 

When we interchange two anyons, we have to be careful to specify whether 
we do it “anticlockwise” or “clockwise,” producing factors e’° and e~'®, respec- 
tively. This indicates immediately that parity P and time reversal invariance T are 
violated. 


Chern-Simons theory 
The next important question is whether all this can be incorporated in a local 


quantum field theory. The answer was given by Wilczek and Zee, who showed that 
the notion of fractional statistics can result from the effect of coupling to a gauge 
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potential. The significance of a field theoretic formulation is that it demonstrates 
conclusively that the idea of anyon statistics is fully compatible with the cherished 
principles that we hold dear and that go into the construction of quantum field 
theory. 

Given a Lagrangian £o with a conserved current j“, construct the Lagrangian 


L=kyt+ yet a,dya, + a,j" (1) 


Here eA denotes the totally antisymmetric symbol in (2 + 1)-dimensional 
spacetime and y is an arbitrary real parameter. Under a gauge transforma- 
tion a,>a,+ O,A, the term eM*a Ody, known as the Chern-Simons term, 
changes by ehYha Aya, > ea Aya, + eX¥¥9 Ada). The action changes by 
bS=y fd 3x ghvr 0,,(Ad,a,) and thus, if we are allowed to drop boundary terms, 
as we assume to be the case here, the Chern-Simons action is gauge invariant. Note 
incidentally that in the language of differential form you learned in Chapters IV.5 
and IV.6 the Chern-Simons term can be written compactly as ada. 
Let us solve the equation of motion derived from (1): 


2ye!™a,a, = —jt# (2) 


for a particle sitting at rest (so that j; = 0). Integrating the 4. = 0 component of 
(2), we obtain 


i . 
[ @x@., _ Ona) = 2p [eu (3) 


Thus, the Chern-Simons term has the effect of endowing the charged particles in 
the theory with flux. (Here the term charged particles simply means particles that 
couple to the gauge potential a,,. In this context, when we refer to charge and flux, 
we are obviously not referring to the charge and flux associated with the ordinary 
electromagnetic field. We are simply borrowing a useful terminology.) 

By the Aharonov-Bohm effect (Chapter IV.4), when one of our particles moves 
around another, the wave function acquires a phase, thus endowing the particles 
with anyon statistics with angle @ = 1/4y (see Exercise VI.1.5). 

Strictly speaking, the term “fractional statistics” is somewhat misleading. First, 
a trivial remark: The statistics parameter 9 does not have to be a fraction. Second, 
statistics is not directly related to counting how many particles we can put into a 
state. The statistics between anyons is perhaps better thought of as a long ranged 
phase interaction between them, mediated by the gauge potential a. 

The appearance of ¢”? in (1) signals the violation of parity P and time reversal 
invariance T , something we already know. 


Hopf term 


An alternative treatment is to integrate out a in (1). As explained in Chapter III.4, 
and as in any gauge theory, the inverse of the differential operator ¢9 is not defined: 
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It has a zero mode since (e"*8,)(9, F (x)) = 0 for any smooth function F(x). Let 
us choose the Lorentz gauge 0,,a" = 0. Then, using the fundamental identity of 
field theory (see Appendix A) we obtain the nonlocal Lagrangian 


i , erg 
Hopf = 4y (1, a2 in) (4) 


known as the Hopf term. 

To determine the statistics parameter 0 consider a history in which one particle 
moves half-way around another sitting at rest. The current j is then equal to the 
sum of two terms describing the two particles. Plugging into (4) we evaluate the 


I f d 3y LyHopf 


quantum phase e!5 = e , obtain 6 = y 


Topological field theory 


There is something conceptually new about the pure Chern-Simons theory 
S=y [ d>xe"™*a, aya, (5) 


It is topological. 

Recall from Chapter I.10 that a field theory written in flat spacetime can be 
immediately promoted to a field theory in curved spacetime by replacing the 
Minkowski metric n“” by the Einstein metric g”” and including a factor ./—g 
in the spacetime integration measure. 

But in the Chern-Simons theory 7#” does not appear! Lorentz indices are 
contracted with the totally antisymmetric symbol e“”*. Furthermore, we don’t 
need the factor ./—g, as I will now show. Recall also from Chapter 1.10 that a 
vector field transforms as a, (x) = (0x"/8x")a‘ (x’) and so for three vector fields 


) = gen OX” Ax’? ax’ ax’? 


Sa pav pax tr Bee, 


ea (by (x)eq (x 
_ Ox’ Oto. cc Abt hy tg 
= det (=) eo Pa (x')b de, ) 
On the other hand, d3x’ = d3x det(@x’/3x). Observe, then, 
d’xe*™ a, (x)b, (xe, (x) = d>x'e? a’ (xbox) 


which is invariant without the benefit of ./—g. 

So, the Chern-Simons action in (5) is invariant under general coordinate 
transformation—it is already written for curved spacetime. The metric g,,,, does 
not enter anywhere. The Chern-Simons theory does not know about clocks and 
rulers! It only knows about the topology of spacetime and is rightly known as a 
topological field theory. In other words, when the integral in (5) is evaluated over 
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a closed manifold M the property of the field theory { Dae'*“ depends only on 
the topology of the manifold, and not on whatever metric we might put on the 
manifold. 


Ground state degeneracy 


Recall from Chapter I.10 the fundamental definition of energy and momentum. The 
energy-momentum tensor is defined by the variation of the action with respect to 
8uy » but hey, the action here does not depend on g,,,. The energy-momentum 
tensor and hence the Hamiltonian is identically zero! One way of saying this is 
that to define the Hamiltonian we need clocks and rulers. 

What does it mean for a quantum system to have a Hamiltonian H = 0? Well, 
when we took a Course on quantum mechanics, if the professor assigned an exam 
problem to find the spectrum of the Hamiltonian 0, we could do it easily! All states 
have energy E = 0. We are ready to hand it in. 

But the nontrivial problem is to find how many states there are. This number 
is known as the ground state degeneracy and depends only on the topology of the 
manifold M. 


Massive Dirac fermions and the Chern-Simons term 


Consider a gauge potential a, coupled to a massive Dirac fermion in (2 + 1)- 
dimensional spacetime: £ = wig+ d— m)vr. You did an exercise way back 
in Chapter II.1 discovering the rather surprising phenomenon that in (2 + 1- 
dimensional spacetime the Dirac mass term violates P and T. Thus, we would 
expect to generate the P and 7 violating the Chern-Simons term eh¥ha andy if 
we integrate out the fermion to get the term tr log(i? + ¢ — m) in the effective 
action, along the lines discussed in Chapter IV.3. 

In one-loop order we have the vacuum polarization diagram (diagrammatically 
exactly the same as in Chapter III.4 but in a spacetime with one less dimension) 
with a Feynman integral proportional to 


dp v 1 wt 
| ely piq-m sa) ©) 


As we will see, the change 4 — 3 makes all the difference in the world. I leave 
it to you to evaluate (6) in detail (Exercise VI.1.7) but let me point out the 
salient features here. Since the 3, in the Chern-Simons term corresponds to q, 
in momentum space, in order to identify the coefficient of the Chern-Simons term 
we need only differentiate (6) with respect to g, and set g > 0: 
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[Srey 
Qn pom’ pom’ p—m 


ap _trly'(p+m)y"(p + m)y"(p +m) 
Qn)? (pm) 


(7) 


I will simply focus on one piece of the integral, t the piece coming from the term in 
the trace proportional to m? 


v d? 
en we | 2 ore (8) 
(p* —m*) 


As I remarked in Exercise I.1.11, in (2 + 1)-dimensional spacetime the y”’s are 
just the three Pauli matrices and thus tr y”y*y¥ is proportional to ¢#”*: The 
antisymmetric symbol appears as we expect from P and T violation. 

By dimensional analysis, we see that the integral in (8) is up to a numerical 
constant equal to 1/m? and so m cancels. 

But be careful! The integral depends only on m? and doesn’t know about the sign 
of m. The correct answer is proportional to 1/|m|>, not 1/m7. Thus, the coefficient 
of the Chern-Simons term is equal to m3/|m|? = m/|m| = sign of m, up to a 
numerical constant. An instructive example of an important sign! This makes sense 
since under P (or T) a Dirac field with mass m is transformed into a Dirac field 
with mass —m. In a parity-invariant theory, with a doublet of Dirac fields with 
masses m and —m a Chern-Simons term should not be generated. 


Exercises 


VI1.1.1. In a nonrelativistic theory you might think that there are two separate Chern- 
Simons terms, &;;4;49a; and €;;a90;4;. Show that gauge invariance forces the 
two terms to combine into a single Chern-Simons term e“”*a,,3,a,. For the 
Chern-Simons term, gauge invariance implies Lorentz invariance. In contrast, the 
Maxwell term would in general be nonrelativistic, consisting of two terms, f and 
fi, with an arbitrary relative coefficient between them (with f,,, = 0,4, — 0,4, 
as usual). 


VI.1.2. By thinking abut mass dimensions, convince yourself that the Chern-Simons term 
dominates the Maxwell term at long distances. This is one reason that relativistic 
field theorists find anyon fluids so appealing. As long as they are interested only 
in long distance physics they can ignore the Maxwell term and play with a rela- 
tivistic theory (see Exercise VI.1.1). Note that this picks out (2 + 1)-dimensional 
spacetime as special. In (3 + 1)-dimensional spacetime the generalization of the 
Chern-Simons term e“”*? f,,, f,,, has the same mass dimension as the Maxwell 
term f?. In (4 + 1)-dimensional space the term eq od uv xo is less important 
at long distances than the Maxwell term f?. 
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VI.1.3. 


V11.4. 


VI.1.5. 


VI.1.6. 


VI.L.7. 


VI.1.8. 


There is a generalization of the Chern-Simons term to higher dimensional space- 
time different from that given in Exercise IV.1.2. We can introduce a p-form 
gauge potential (see Chapter IV.4). Write the generalized Chern-Simons term in 
(2p + 1)-dimensional spacetime and discuss the resulting theory. 


Consider £ = yae da — (1/4g*) f*. Calculate the propagator and show that the 
gauge boson is massive. Some physicists puzzled by fractional statistics have 
reasoned that since in the presence of the Maxwell term the gauge boson is 
massive and hence short ranged, it can’t possibly generate fractional statistics, 
which is manifestly an infinite ranged interaction. (No matter how far apart the 
two particles we are interchanging are, the wave function still acquires a phase.) 
The resolution is that the information is in fact propagated over an infinite range by 
ag = Opole associated with a gauge degree of freedom. This apparent paradox is 
intimately connected with the puzzlement many physicists felt when they first 
heard of the Aharonov-Bohm effect. How can a particle in a region with no 
magnetic field whatsoever and arbitrarily far from the magnetic flux know about 
the existence of the magnetic flux? 


Show that 0 = 1/4y. There is a somewhat tricky factor! of 2. So if you are off by 
a factor of 2, don’t despair. Try again. 


Find the nonabelian version of the Chern-Simons term ada. [Hint: As in Chapter 
(IV.6) it might be easier to use differential forms.} 


Using the canonical formalism of Chapter (1.8) show that the Chern-Simons 
Lagrangian leads to the Hamiltonian H = 0. 


Evaluate (6). 


1X.G. Wen and A. Zee, J. de Physique, 50: 1623, 1989. 


Chapter VI.2 
Quantum Hall Fluids 


Interplay between two pieces of physics 


Over the last decade or so, the study of topological quantum fluids (of which the 
Hal! fluid is an example) has emerged as an interesting subject. The quantum Hail 
system consists of a bunch of electrons moving in a plane in the presence of an 
external magnetic field B perpendicular to the plane. The magnetic field is assumed 
to be sufficiently strong so that the electrons all have spin up, say, so they may 
be treated as spinless fermions. As is well known, this seemingly innocuous and 
simple physical situation contains a wealth of physics, the elucidation of which 
has led to two Nobel prizes. This remarkable richness follows from the interplay 
between two basic pieces of physics. 

1. Even though the electron is pointlike, it takes up a finite amount of room. 

Classically, a charged particle in a magnetic field moves in a Larmor circle of 
radius r determined by evB = mv?/r. Classically, the radius is not fixed, with 
more energetic particles moving in larger circles, but if we quantize the angular 
momentum mur to be h = 27 (in units in which # is equal to unity) we obtain 
eBr? ~ 27. A quantum electron takes up an area of order zr? ~ 277/eB. 

2. Electrons want to stay out of each other’s way. 

Not only does each electron insist on taking up a finite amount of room, each 
has to have its own room. Thus, the quantum Hall problem may be described as 
a sort of housing crisis, or as the problem of assigning office space at an Institute 
for Theoretical Physics to visitors who do not want to share offices. 

Already at this stage, we would expect that when the number of electrons N, is 
just right to fill out space completely, namely when N,2r? ~ N,(2m7/eB) ~ A, 
the area of the system, something special happens. 


Landau levels and the integer Hall effect 


These heuristic considerations could be made precise, of course. The textbook 
problem of a single spinless electron in a magnetic field 


—[(a, — i@A,)” + (8, —ieA,y)" Wy = WME 
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was solved by Landau decades ago. The states occur in degenerate sets with energy 
E, = (n + 1) 2B n=O, 1,2,---, known as the nth Landau level. Each Landau 
level has degeneracy B A/27, where A is the area of the system, reflecting the fact 
that the Larmor circles may be placed anywhere. Note that one Landau level is 
separated from the next by a finite amount of energy (eB /m). 

Imagine putting in noninteracting electrons one by one. By the Pauli exclusion 
principle, each succeeding electron we put in has to go into a different state in the 
Landau level. Since each Landau level can hold BA/2z7 electrons it is natural (see 
Exercise VI.2.1) to define a filling factor v = N,/(BA/27). When v is equal to 
an integer, the first v Landau level is filled. If we want to put in one more electron, 
it would have to go into the (v + 1)st Landau level, costing us more energy than 
what we spent for the preceding electron. 

Thus, for v equal to an integer the Hall fluid is incompressible. Any attempt 
to compress it lessens the degeneracy of the Landau levels (the effective area 
A decreases and so the degeneracy BA/2m decreases) and forces some of the 
electrons to the next level, costing us lots of energy. 

An electric field E, imposed on the Hall fluid in the y direction produces a 
current J, =0,,E, in the x direction with o,, = v (in units of e*/h). This is 
easily understood in terms of the Lorentz force law obeyed by electrons in the 
presence of a magnetic field. The surprising experimental discovery was that the 
Hall conductance o,, when plotted against B goes through a series of plateaus, 
which you might have heard about. To understand these plateaus we would have 
to discuss the effect of impurities. I will touch upon the fascinating subject of 
impurities and disorder in Chapter VI.8. 

So, the integer quantum Hall effect is relatively easy to understand. 


Fractional Hall effect 


After the integer Hall effect, the experimental discovery of the fractional Hall 
effect, namely that the Hall fluid is also incompressible for filling factor v equal 
to simple odd-denominator fractions such as j and i, took theorists completely by 
surprise. For v = 3, only one-third of the states in the first Landau level are filled. 
It would seem that throwing in a few more electrons would not have that much 
effect on the system. Why should the v = } Hall fluid be incompressible? 

Interaction between electrons turns out to be crucial. The point is that saying 
the first Landau level is one-third filled with noninteracting spinless electrons does 
not define a unique many-body state: there is an enormous degeneracy since each 
of the electrons can go into any of the BA/27 states available subject only to 
Pauli exclusion. But as soon as we turn on a repulsive interaction between the 
electrons, a presumably unique ground state is picked out within the vast space of 
degenerate states. Wen has described the fractional Hall state as an intricate dance 
of electrons: Not only does each electron occupy a finite amount of room on the 
dance floor, but due to the mutual repulsion, it has to be careful not to bump into 
another electron. The dance has to be carefully choreographed, possible only for 
certain special values of v. 
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Impurities also play an essential role, but we will postpone the discussion of 
impurities to Chapter V1.6. 

In trying to understand the fractional Hall effect, we have an important clue. You 
will remember from Chapter V.7 that the fundamental unit of flux is given by 27, 
and thus the number of flux quanta penetrating the plane is equal to Ny = BA/27. 
Thus, the puzzle is that something special happens when the number of flux quanta 
per electron Ny/N, = vis an odd integer. 

I arranged the chapters so that what you learned in the previous chapter is 
relevant to solving the puzzle. Suppose that v—! flux quanta are somehow bound to 
each electron. When we interchange two such bound systems there is an additional 
Aharonov-Bohm phase in addition to the (—1) from the Fermi statistics of the 
electrons. For v—! odd these bound systems effectively obey Bose statistics and 
can be described by a complex scalar field y. The condensation of ¢ turns out to 
be responsible for the physics of the quantum Hall fluid. 


Effective field theory of the Hall fluid 


We would like to derive an effective field theory of the quantum Hall fluid, first 
obtained by Kivelson, Hansson, and Zhang. There are two alternative derivations, 
a long way and a short way. 

In the long way, we start with the Lagrangian describing spinless electrons in 
a magnetic field in the second quantized formalism, 


L = Wt i(d—i Ay) vt = vt -i Aye + VW) 


and massage it into the form we want. In the previous chapter, we learned that by 
introducing a Chern-Simons gauge field we can transform y into a scalar field. 
We then invoke duality, which we will learn about in the next chapter, to represent 
the phase degree of freedom of the scalar field as a gauge field. After a number of 
steps, we will discover that the effective theory of the Hall fluid turns out to be a 
Chern-Simons theory. 

Instead, I will follow the short way. We will argue by the “what else can it be” 
method or, to put it more elegantly, by invoking general principles. 

Let us start by listing what we know about the Hall system. 


1. We live in (2 + 1)-dimensional spacetime (because the electrons are re- 
stricted to a plane.) 
2. The electromagnetic current J,, is conserved: 3,,J" = 0. 


These two statements are certainly indisputable; when combined they tell us 
that the current can be written as the curl of a vector potential 


Je= laa, (2) 
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The factor of 1/(277) defines the normalization of a,,. We learned in school that 
in 3-dimensional spacetime, if the divergence of something is Zero, then that 
something is the curl of something else. That is precisely what (2) says. The only 
sophistication here is that what we learned in school works in Minkowskian space 
as well as Euclidean space—it is just a matter of a few signs here and there. 


The gauge potential comes looking for us 


Observe that when we transform a,, by a,, > a, — @,,A, the current is unchanged. 
In other words, a, is a gauge potential. 

We did not go looking for a gauge potential; the gauge potential came looking 
for us! There is no place to hide. The existence of a gauge potential follows from 
completely general considerations. 

3. We want to describe the system field theoretically by an effective local 
Lagrangian. 

4. We are only interested in the physics at long distances and large time, that is, 
at small wave number and low frequency. 

Indeed, a field theoretic description of a physical system may be regarded as a 
means of organizing various aspects of the relevant physics in a systematic way 
according to their relative importance at long distances and according to symme- 
tries. We classify terms in a field theoretic Lagrangian according to powers of 
derivatives, powers of the fields, and so forth. A general scheme for classifying 
terms is according to their mass dimensions, as explained in Chapter III.2. The 
gauge potential a,, has dimension 1, as is always the case for any gauge potential 
coupled to matter fields according to the gauge principle, and thus (2) is consis- 
tent with the fact that the current has mass dimension 2 in (2 + 1)-dimensional 
spacetime. 

5. Parity and time reversal are broken by the external magnetic field. 

This last statement is just as indisputable as statements 1 and 2. The experi- 
mentalist produces the magnetic field by driving a current through a coil with the 
current flowing either clockwise or anticlockwise. 

Given these five general statements we can deduce the form of the effective 
Lagrangian. 

Since gauge invariance forbids the dimension-2 term a,,a” in the Lagrangian, 
the simplest possible term is in fact the dimension-3 Chern-Simons term 
cH¥\q 8a). Thus, the Lagrangian is simply 


C= aeaa t+: (3) 
4n 


where k is a dimensionless parameter to be determined. 
We have introduced and will use henceforth the compact notation eadb = 
cA q a,b, = €bda for two vector fields a,, and b,. 
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The terms indicated by (-- -) in (3) include the dimension-4 Maxwell term 
(1/g°)(fg; — 5) and other terms with higher dimensions. The important observa- 
tion is that these higher dimensional terms are less important at long distances. The 
long distance physics is determined purely by the Chern-Simons term. In general 
the coefficient k may well be zero, in which case the physics is determined by the 
short distance terms represented by the (- - -) in (3). Put differently, a Hall fluid 
may be defined as a 2-dimensional electron system for which the coefficient of the 
Chern-Simons term does not vanish, and consequently is such that its long distance 
physics is largely independent of the microscopic details that define the system. 
Indeed, we can classify 2-dimensional electron systems according to whether k is 
zero or not. 

Coupling the system to an “external” or “additional” electromagnetic gauge 
potential A,, and using (2) we obtain (after integrating by parts and dropping a 
surface term) 


k 1 k 1 
L= —eYha Bay _ 5 Ay Bua = ye anda _ 5 au avAy 
(4) 


Note that the gauge potential of the magnetic field responsible for the Hall effect 
should not be included in A,,; it is implicitly contained already in the coefficient k. 

The notion of quasiparticles or “elementary” excitations is basic to condensed 
matter physics. The effects of a many-body interaction may be such that the quasi- 
particles in the system are no longer electrons. Here we define the quasiparticles 
as the entities that couple to the gauge potential and thus write 


k . 1 
L= a 4 + a,j? — 5 aay nae (5) 


Defining ju = Jy — A/2m) ey 0” A* and integrating out the gauge field we obtain 
(see VI.1.4) 


n~ [cha ~ 
L=— jy (<*) Ja, (6) 


Fractional charge and statistics 


We can now simply read off the physics from (6). The Lagrangian contains 
three types of terms: AA, Aj, and jj. The AA term has the schematic form 
A(ed€d€9/8*)A. Using €d¢3 ~ 8? and canceling between numerator and de- 
nominator, we obtain 


oa cana (7) 
4nk 
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Varying with respect to A we determine the electromagnetic current 


1 
je = ee Bes (8) 
We learn from the 4 =0 component of this equation that an excess density 
én of electrons is related to a local fluctuation of the magnetic field by én = 
(1/27rk)6B; thus we can identify the filling factor v as 1/k, and from the p =i 
components that an electric field produces a current in the orthogonal direction with 
Oxy = (I/k) =v. 
The Aj term has the schematic form A(€@¢4/87)j. Canceling the differential 
operators, we find 


1, , 
L= poe x (9) 
Thus, the quasiparticle carries electric charge I/k. 
Finally, the quasiparticles interact with each other via 


Enyad 
ie i (10) 


L= 
We simply remove the twiddle sign in (6). Recalling Chapter (VI.1) we see that 
quasiparticles obey fractional statistics with 


@ 


if 
Tk (11) 

By now, you may well be wondering that while all this is fine and good, what 
would actually tell us that v~! has to be an odd integer? 

We now argue that the electron or hole should appear somewhere in the excita- 
tion spectrum. After all, the theory is supposed to describe a system of electrons 
and thus far our rather general Lagrangian does not contain any reference to the 
electron! 

Let us look for the hole (or electron). We note from (9) that a bound object made 
up of k quasiparticles would have charge equal to 1. This is perhaps the hole! For 
this to work, we see that k has to be an integer. So far so good, but k doesn’t have 
to be odd yet. 

What is the statistics of this bound object? Let us move one of these bound 
objects half-way around another such bound object, thus effectively interchanging 
them. When one quasiparticle moves around another we pick up a phase given by 
@/m = 1/k according to (11). But here we have k quasiparticles going around k 
quasiparticles and so we pick up a phase 


GH leu, (12) 
an ok 
For the hole to be a fermion we must require @/s to be an odd integer. This fixes 


Kk to be an odd integer. 
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Since v = 1/k, we have here the classic Laughlin odd-denominator Hall fluids 
with filling factor v = 5, 3, 3, --- . The famous result that the quasiparticles carry 
fractional charge and statistics just pops out [see (9) and (11)]. 

This is truly dramatic: a bunch of electrons moving around in a plane with 
a magnetic field corresponding to v = 3, and lo and behold, each electron has 
fragmented into three pieces, each piece with charge 4 and fractional statistics 1! 


A new kind of order 


The goal of condensed matter physics is to understand the various states of mat- 
ter. States of matter are characterized by the presence (or absence) of order: a 
ferromagnet becomes ordered below the transition temperature. In the Landau- 
Ginzburg theory, as we saw in Chapter V.3, order is associated with spontaneous 
symmetry breaking, described naturally with group theory. Girvin and MacDonald 
first noted that the order in Hall fluids does not really fit into the Landau-Ginzburg 
scheme: We have not broken any obvious symmetry. The topological property of 
the Hall fluids provides a clue to what is going on. As explained in the preceding 
chapter, the ground state degeneracy of a Hall fluid depends on the topology of 
the manifold it lives on, a dependence group theory is incapable of accounting 
for, Wen has forcefully emphasized that the study of topological order, or more 
generally quantum order, may open up a vast new vista on the possible states of 
matter. 


Comments and generalization 


Let me conclude with several comments that might spur you to explore the wealth 
of literature on the Hall fluid. 

1. The appearance of integers implies that our result is robust. A slick argument 
can be made based on the remark in the previous chapter that the Chern-Simons 
term does not know about clocks and rulers and hence can’t possibly depend on 
microphysics such as the scattering of electrons off impurities which cannot be 
defined without clocks and rulers. In contrast, the physics that is not part of the 
topological field theory and described by (- - -) in (3) would certainly depend on 
detailed microphysics. 

2. If we had followed the long way to derive the effective field theory of the Hall 
fluid, we would have seen that the quasiparticle is actually a vortex constructed (as 
in Chapter V.7) out of the scalar field representing the electron. Given that the Hall 
fluid is incompressible, just about the only excitation you can think of is a vortex 
with electrons coherently whirling around. 

3. In the previous chapter we remarked that the Chern-Simons term is gauge 
invariant only upon dropping a boundary term. But real Hall fluids in the labora- 
tory live in samples with boundaries. So how can (3) be correct? Remarkably, this 
apparent “defect” of the theory actually represents one of its virtues! Suppose the 
theory (3) is defined on a bounded 2-dimensional manifold, a disk for example. 
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Then as first argued by Wen there must be physical degrees of freedom living on 
the boundary and represented by an action whose change under a gauge trans- 
formation cancels the change of f d°x(k/4m)ae da. Physically, it is clear that an 
incompressible fluid would have edge excitations! corresponding to waves on its 
boundary. 

4. What if we refuse to introduce gauge potentials? Since the current J, has 
dimension 2, the simplest term constructed out of the currents, J i J, is already of 
dimension 4; indeed, this is just the Maxwell term. There is no way of constructing 
a dimension 3 local interaction out of the currents directly. To lower the dimension 
we are forced to introduce the inverse of the derivative and write schematically 
J(1/ea)J, which is of course just the non-local Hopf term. Thus, the question 
“why gauge field?” that people often ask can be answered in part by saying that the 
introduction of gauge fields allows us to avoid dealing with nonlocal interactions. 

5. Experimentalists have constructed double-layered quantum Hall systems 
with an infinitesimally small tunneling amplitude for electrons to go from one layer 
to the other. Assuming that the current J a (7 = 1, 2) in each layer is separately 
conserved, we introduce two gauge potentials by writing J * = sre O.a 7, as in 
(2). We can repeat our general argument and arrive at the effective Lagrangian 


K 
b=) rayeday ++ (13) 
I,J 


The integer k has been promoted to a matrix K. As an exercise, you can derive 
the Hall conductance, the fractional charge, and the statistics of the quasiparticles. 
You would not be surprised that everywhere 1/k appears we now have the matrix 
inverse K~! instead. An interesting question is what happens when K has a 


zero eigenvalue. For example, we could have K = ( 7 ) . Then the low energy 


dynamics of the gauge potential a_ = a, — ap is not governed by the Chern-Simons 
term, but by the Maxwell term in the (. - -) in (13). We have a linearly dispersing 
mode and thus a superfluid! This striking prediction” was verified experimentally. 

6. Finally, an amusing remark: In this formalism electron tunneling corresponds 
to the nonconservation of the current J = Jy — Js = (1/2mr)e""*d,a_,. The 
difference N, — N, of the number of electrons in the two layers is not conserved. 
But how can 0,,J * £0 even though J" is the curl of a_, (as I have indicated 
explicitly)? Recalling Chapter IV.4, you the astute reader say, aha, magnetic 
monopoles! Tunneling in a double-layered Hall system in Euclidean spacetime can 
be described as a gas of monopoles and antimonopoles.’ (Think, why monopoles 
and antimonopoles?) Note of course that these are not monopoles in the usual 
electromagnetic gauge potential but in the gauge potential a_,. 


' The existence of edge currents in the integer Hall fluid was first pointed out by Halperin. 
2X. G. Wen and A. Zee, Phys. Rev. Lett. 69: 1811, 1992. 
3X. G. Wen and A. Zee, Phys. Rev. B47: 2265, 1993. 
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What we have given in this section is certainly a very slick derivation of the 
effective long distance theory of the Hall fluid. Some would say too slick. Let us go 
back to our five general statements or principles. Of these five, four are absolutely 
indisputable. In fact, the most questionable is the statement that looks the most 
innocuous to the casual reader, namely statement 3. In general, the effective 
Lagrangian for a condensed matter system would be nonlocal. We are implicitly 
assuming that the system does not contain a massless field, the exchange of which 
would lead to a nonlocal interaction. Also implicit in (3) is the assumption that 
the Lagrangian can be expressed completely in terms of the gauge potential a. A 
priori, we certainly do not know that there might not be other relevant degrees of 
freedom. The point is that as long as these degrees of freedom are not gapless they 
can be safely integrated out. 


Exercises 


VI.2.1. To define filling factor precisely, we have to discuss the quantum Hall system on 
a sphere rather than on a plane. Put a magnetic monopole of strength G (which 
according to Dirac can be only a half-integer or an integer) at the center of a unit 
sphere. The flux through the sphere is equal to Ny = 2G. Show that the single 
electron energy is given by E, = (jhw,) [/@ + 1) — G?] /G with the Landau 
levels corresponding to/ =G,G+1,G+4+2, ---. and that the degeneracy of 
the /th level is 2/ + 1. With Z Landau levels filled with noninteracting electrons 
(v = L) show that N, = v~'N, — 8, where the topological quantity § is known 
as the shift. 


VI.2.2. Fora challenge, derive the effective field theory for Hall fluids with filling factor 
v =m/k with k an odd integer, such as v = 2. [Hint: You have to introduce m 
gauge potentials a,, and generalize (2) to J¢ = (1/2r)e4"44, "7", az,. The 
effective theory turns out to be 


lu m - 
L=— a,K,,;€8a,;+ a,,jiut--- 
in rATy J X Iud 


with the integer k replaced by a matrix K. Compare with (13).] 


VI.2.3. For the Lagrangian in (13), derive the analogs of (8), (9), and (11). 


4 A technical remark: Vortices (ie., quasiparticles) pinned to impurities in the Hall fluid 
can generate an interaction nonlocal in time. 


Chapter VI.3 


Duality 


A far reaching concept 


Duality is a profound and far reaching concept! in theoretical physics, with origins 
in electromagnetism and statistical mechanics. The emergence of duality in recent 
years in several areas of modern physics, ranging from the quantum Hall fluids 
to string theory, represents a major development in our understanding of quantum 
field theory. Here I touch upon one particular example just to give you a flavor of 
this vast subject. 

My plan is to treat a relativistic theory first, and after you get the hang of 
the subject, I will go on to discuss the nonrelativistic theory. It makes sense 
that some of the interesting physics of the nonrelativistic theory is absent in the 
relativistic formulation: A larger symmetry is more constraining. By the same 
token, the relativistic theory is actually much easier to understand if only because 
of notational simplicity. 


Vortices 


Couple a scalar field in (2 + 1)-dimensions to an external electromagnetic gauge 
potential, with the electric charge g indicated explicitly for later convenience: 


= 31(8, —igA,)el — Vig"g) (1) 
We have already studied this theory many times, most recently in Chapter V.7 in 
connection with vortices. As usual, write g = |g|e’”. Minimizing the potential V 


at |p| = v gives the ground state field configuration. Setting g = ve'® in (1) we 
obtain 


L = 407(8,0 —GA,)° (2) 
' Ror a first introduction to duality, I highly recommend J. M. Figueroa-O’Farrill, Elec- 
tromagnetic Duality for Children, http://www.maths.ed.ac.uk/~jmf/EDC. html. 
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which upon absorbing @ into A by a gauge transformation we recognize as the 
Meissner Lagrangian. For later convenience we also introduce the alternative form 


1 
L= 58 + &"(3,0 —qA,) (3) 


We recover (2) upon eliminating the auxiliary field €“ (see Appendix A). 

In Chapter V.7 we learned that the excitation spectrum includes vortices and 
anti-vortices, located where |g| vanishes. If around the zero of |y/, @ changes by 
27, we have a vortex. Around an antivortex, A@ = —27. Recall that around a 
vortex Sitting at rest, the electromagnetic gauge potential has to go as 


qA; > 9,6 (4) 


at spatial infinity in order for the energy of the vortex to be finite, as we can see 
from (2). The magnetic flux 


~ > AC 2 
[ @seyaaj= of dz A= ome (5) 


is quantized in units of 27 /q. 

Let us pause to think physically for a minute. On a distance scale large compared 
to the size of the vortex, vortices and antivortices appear as points. As discussed 
in Chapter V.7, the interaction energy of a vortex and an antivortex separated 
by a distance R is given by simply plugging into (2). Ignoring the probe field 
A,,, Which we can take to be as weak as possible, we obtain ~ f dr r(vey? ~ 
log(R/a) where a is some short distance cutoff. But recall that the Coulomb 
interaction in 2-dimensional space is logarithmic since by dimensional analysis 
f d2kei* * 1k?) ~ log(|x|/a) (with a~! some ultraviolet cutoff). Thus, a gas of 
vortices and antivortices appears as a gas of point “charges” with a Coulomb 
interaction between them. 


Vortex as charge in a dual theory 


Duality is often made out by some theorists to be a branch of higher mathematics 
but in fact it derives from an entirely physical idea. In view of the last paragraph, 
can we not rewrite the theory so that vortices appear as point “charges” of some 
as yet unknown gauge field? In other words, we want a dual theory in which the 
fundamental field creates and annihilates vortices rather than g quanta. We will 
explain the word “dual” in due time. 

Remarkably, the rewriting can be accomplished in just a few simple steps. 
Proceeding physically and heuristically, we picture the phase field @ as smoothly 
fluctuating, except that here and there it winds around 277. Write 3,,0 = ,,9smooth + 
9,, Vortex. Plugging into (3) we write 


8 + gM (a 1 smooth + 0,0 vortex —qA,) (6) 
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Integrate over smooth and obtain the constraint 3,64 = 0, which can be solved by 
writing 


EM — gl¥AG gy (7) 


a trick we used earlier in Chapter VI.2. As in that chapter, a gauge potential comes 
looking for us, since the change a, —> a, + 8,A does not change §”. Plugging 
into (6), we find 


1 
A= ere + EG. (8 Pvortex ~ qAy) (8) 


where fi, = 9,4, — day, 

Our treatment is heuristic because we ignore the fact that |g| vanishes at the 
vortices. Physically, we think of the vortices as almost pointlike so that |g| = v 
“essentially” everywhere. As mentioned in Chapter V.6, by appropriate choice of 
parameters we can make solitons, vortices, and so on as small as we like. In other 
words, we neglect the coupling between 6, o,4¢, and ||. A rigorous treatment would 
require a proper short distance cutoff by putting the system on a lattice. ? But as 
long as we capture the essential physics, as we assuredly will, we will ignore such 
niceties. 

Note for later use that the electromagnetic current J“, defined as the coefficient 
of — A,, in (8), is determined in terms of the gauge potential a, to be 


J! = qe! 4a, (9) 


Let us integrate the term e“”*3,a,.3,,Oyorex in (8) by parts to obtain a,e*“"4,, 
8, Ovortex: According to Newton and Leibniz, 4,, commutes with ,, and so appar- 
ently we get zero. But 4,, and 4, commute only when acting on a globally defined 
function, and, heavens to Betsy, @,ore, is not globally defined since it changes 
by 22 when we go around a vortex. In particular, consider a vortex at rest and 
look at the quantity ag couples to in (8) namely e'/ 4,4 ‘vortex = =Vx (VO yortex) 
in the notation of elementary physics. Integrating this over a region containing 
the vortex gives { d2xV x (VO vortex) = = $ dx - VO vortex = = 27. Thus, we recognize 
(1/2m)e" 9 8 ;Ovortex a8 the density of vortices, the time component of some vortex 
current j*_.,. By Lorentz invariance, jo, = (1/2 )e*#"8,,8, A vortex: 

Thus, we can now write (8) as 


1 . 
L= —Ta fiw + (27) Ay forex — Au(qen”*8ya)) (10) 
Lo and behold, we have accomplished what we set out to do. We have rewritten 


the theory so that the vortex appears as an “electric charge” for the gauge potential 


2 For example, M. P. A. Fisher, “Mott Insulators, Spin Liquids, and Quantum Disordered 
Superconductivity,” cond-mat/9806164, Appendix A. 
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a,,- Sometimes this is called a dual theory, but strictly speaking, it is more accurate 
to refer to it as the dual representation of the original theory (1). 

Let us introduce a complex scalar field ®, which we will refer to as the vortex 
field, to create and annihilate the vortices and antivortices. In other words, we 
“elaborate” the description in (10) to 


1 


I ; 
=~ Tahun + 51Qu—iAm)a,yoP — W(%) — A,(ge""*d,a,) (11) 


The potential W(®) contains terms such as A(@?@)2 describing the short distance 
interaction of two vortices (or a vortex and an antivortex.) In principle, if we master 
all the short distance physics contained in the original theory (1) then these terms 
are all determined by the original theory. 


Vortex of a vortex 


Now we come to the most fascinating aspect of the duality representation and 
the reason why the word “dual” is used in the first place. The vortex field ® is a 
complex scalar field, just like the field g we started with. Thus we can perfectly 
well form a vortex out of @, namely a place where ® vanishes around which the 
phase of ® goes through 27. Amusingly, we are forming a vortex of a vortex, so 
to speak. 

So, what is a vortex of a vortex? 

The duality theorem states that the vortex of a vortex is nothing but the original 
charge, described by the field g we started out with! Hence the word duality. 

The proof is remarkably simple. The vortex in the theory (11) carries “magnetic 
flux.” Referring to (11) we see that 27a; — 9,6 at spatial infinity. By exactly the 
same Manipulation as in (5), we have 


20 [ @xeia; =2n § dx -a =2n (12) 


Note that I put quotation marks around the term “magnetic flux” since as is evident 
Tam talking about the flux associated with the gauge potential a,, and not the flux 
associated with the electromagnetic potential A,,. But remember that from (9) the 
electromagnetic current J = ge“”*d,a, and in particular J° = ge‘! a,a j- Hence, 
the electric charge (note no quotation marks) of this vortex of a vortex is equal 
to f d’xJ° =q, precisely the charge of the original complex scalar field g. This 
proves the assertion. 

Here we have studied vortices, but the same sort of duality also applies to 
monopoles. As I remarked in Chapter IV.4, duality allows us a glimpse into field 
theories in the strongly coupled regime.We learned in Chapter V.7 that certain 
spontaneously broken nonabelian gauge theories in (3 + 1)-dimensional spacetime 
contains magnetic monopoles. We can write a dual theory in terms of the monopole 
field out of which we can construct monopoles. The monopole of a monopole 
turns out be none other than the charged fields of the original gauge theory. This 
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duality was first conjectured many years ago by Olive and Montonen and later 
shown to be realized in certain supersymmetric gauge theories by Seiberg and 
Witten. The understanding of this duality was a “hot” topic a few years ago as it 
led to deep new insight about how certain string theories are dual to each other. 
In contrast, according to one of my distinguished condensed matter colleagues, 
the important notion of duality is still underappreciated in the condensed matter 
physics community. 


Meissner begets Maxwell and so on 


We will close by elaborating slightly on duality in (2 + 1)-dimensional spacetime 
and how it might be relevant to the physics of 2-dimensional materials. Consider a 
Lagrangian £ (a) quadratic in a vector field a,,. Couple an external electromagnetic 
gauge potential A,, to the conserved current EHYA9 a, : 


£=L(a)+ A, (e""*3,a)) (13) 
Let us ask: For various choices of £ (a), if we integrate out a whatis the effective 
Lagrangian £(A) describing the dynamics of A? 
If you have gotten this far in the book, you can easily do the integration. The 
central identity of quantum field theory again! Given 
L(a)~aKa (14) 
we have 
I I 
L(A) ~ (€0A)— (ed A) ~ A(ed—ED)A (15) 
K K 
We have three choices for &(a) to which I attach various illustrious names: 


L(a)~ a? Meissner 


L(a) ~ aeda Chern-Simons (16) 


L(a) ~ f* ~ad’a | Maxwell 


Since we are after conceptual understanding, I won’t bother to keep track of 
indices and irrelevant overall constants. (You can fill them in as an exercise.) For 
example, given &(a) = f,,f" with f,, = 9,4, — d,a,, we can write &(a) ~ 
ad’a and so K = 3°. Thus, the effective dynamics of the external electromagnetic 
gauge potential is given by (15) as &(A) ~ A[ed(1/ 87)ea]A ~ A’, the Meissner 
Lagrangian! In this “quick and dirty” way of making a living, we simply set 


3 For example, D. I. Olive and P. C. West, eds., Duality and Supersymmetric Theories. 
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ee ~ 1 and cancel factors of 0 in the numerator against those in the denominator. 
Proceeding in this way, we construct the following table: 


Dynamics of a K | Effective Lagrangian Dynamics of the 
L(A) ~ Afed(1/K)ea]A external probe A 


Chern-Simons aeda A(ed 68) ~ AcdA Chern-Simons 
A&tdA 


Maxwell f? ~ a92a A(edzhea)A ~ AA Meissner A? 


(17) 


Meissner begets Maxwell, Chern-Simons begets Chern-Simons, and Maxwell 
begets Meissner. I find this beautiful and fundamental result, which represents a 
form of duality, very striking. Chern-Simons is self-dual: It begets itself. 


Going nonrelativistic 


It is instructive to compare the nonrelativistic treatment of duality.4 Go back to the 
superfluid Lagrangian of Chapter V.1: 


. I ~ 
L = ip! ayy — = aip'dip — 8°(o'g — py (18) 
As before, substitute g = /pel® to obtain 
L = —pao — © (8,6)? — 9p — pyr +-- (19) 
2m 
which we rewrite as 
m _ 
L=—§,3"0 + —& — 8p — py te (20) 
2p 
In (19) we have dropped a term ~ (8, '/2)?. In (20) we have defined & =p. 
Integrating out ; in (20) we recover (19). 

All proceeds as before. Writing 6 = 6.,nooth + Pvortex and integrating out O, nooth> 
we obtain the constraint 0“, = 0, solved by writing &” = «“”*4,a,. The hat on 
a, is for later convenience, Note that the density 

by =p =6,0,4,=f (21) 


4 The treatment given here follows essentially that given by M. P. A. Fisher and D. H. 
Lee. 
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is the “magnetic” field strength while 
§) = €j @o@j — 9;40) = Jo; (22) 


is the “electric” field strength. 
Putting all of this into (20) we have 


m a - mo 
L= apie _ ef _ p)° _ 27 Gy, Hortex tee: (23) 


To “subtract out” the background “magnetic” field 6, an obviously sensible move 
is to write 


a, =4, +4, (24) 


where we define the background gauge potential by ay = 0, dga; = 0 (no back- 
ground “electric” field) and 


€{j9;4; = p (25) 


The Lagrangian (23) then takes on the cleaner form 


m . 
L= (48 —_ f°) _ 27 Ay ivortex — 27 Gi; j' sex tee (26) 


We have expanded p ~ 9 in the first term. As in (10) the first two terms form the 
Maxwell Lagrangian, and the ratio of their coefficients determines the speed of 
propagation 


1/2 
= (£2) (27) 


m 


In suitable units in which c = 1, we have 
__ mm WY _ Ong GH 9 28 
b= 40 Suv tf — 2M ay Jonex ~ Gi Forex tee: (28) 


Compare this with (10). ~ 
The one thing we missed with our relativistic treatment is the last term in (28), 
for the simple reason ihat we didn’t put in a background. Recall that the term 
like A; J; in ordinary electromagnetism means that a moving particle associated 
with the current J; sees a magnetic field Vv x A. Thus, a moving vortex will see a 
“magnetic field” 


€,j9;(4 + 4); = P + €;;9;4; (29) 


equal to the sum of p the density of the original bosons and a fluctuating field. 
In the Coulomb gauge 4.a; = 0 we have (fo;)” = (89a;)? + (0; -dy)*, where the 
cross term (d9a;)(0;a) effectively vanishes upon integration by parts. Integrating 
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out the Coulomb field ag, we obtain 


L= - ? any | | d’xd’y Ee log a) io) 
wn 


a 


+ 75 oa)” — gf? +2n(a; +4) jY"™ (30) 


The vortices repel each other by a logarithmic interaction /{ ak (etki /k°) ~ 
log({x|/a) as we have known all along. 


A self-dual theory 


Interestingly, the spatial part f* of the Maxwell Lagrangian comes from the short 
ranged repulsion between the original bosons. 

If we had taken the bosons to interact by an arbitrary potential V(x) we would 
have, instead of the last term in (20), 


[[ @x@y toe - AVE» [oo -A] G1) 


It is easy to see that ail the steps go through essentially as before, but now the 
second term in (26) becomes 


[[ @x@sreove —»Fo 2) 
Thus, the gauge field propagates according to the dispersion relation 


wo = (2p/m)V (k)k (33) 


where V(k) is the Fourier transformation of V(x). In the special case V(x) = 
g°5) (x) we recover the linear dispersion given in (27). Indeed, we have a linear 
dispersion w x lk as long as V (x) is sufficiently short ranged for V(k = 0) to be 
finite. 

An interesting case is when V (x) is logarithmic. Then V (k) goes as 1/k? and so 
«@ ~ constant: The gauge field a; becomes massive and drops out. The low energy 
effective theory consists of a bunch of vortices with a logarithmic interaction 
between them. Thus, a theory of bosons with a logarithmic repulsion between 
them is self dual in the low energy limit. 


The dance of vortices and antivortices 
Having gone through this nonrelativistic discussion of duality, let us reward our- 


selves by deriving the motion of vortices in a fluid. Let the bulk of the fluid be at 
rest. According to (28) the vortex behaves like a charged particle in a background 


VL3. Duality 317 


(a) (b) 


Figure VI.3.1 


magnetic field b proportional to the mean density of the fluid 6. Thus, the force 
acting on a vortex is the usual Lorentz force ¥ x B, and the equation of motion of 
the vortex in the presence of a force F is then just 

Pej jx; = F; (34) 
This is the well-known result that a vortex, when pushed, moves in a direction 
perpendicular to the force. 

Consider two vortices. According to (28) they repel each other by a logarithmic 
interaction. They move perpendicular to the force. Thus, they end up circling each 
other. In contrast, consider a vortex and an antivortex, which attract each other. As 
a result of this attraction, they both move in the same direction, perpendicular to 
the straight line joining them (see Fig. VI.3.1). The vortex and antivortex move 
along in step, maintaining the distance between them. This in fact accounts for 
the famous motion of a smoke ring. If we cut a smoke ring through its center and 
perpendicular to the plane it lies in, we have just a vortex with an antivortex for 
each section. Thus, the entire smoke ring moves along in a direction perpendicular 
to the plane it lies in. 

All of this can be understood by elementary physics, as it should be. The key 
observation is simply that vortices and antivortices produce circular flows in the 
fluid around them, say clockwise for vortices and anticlockwise for antivortices. 
Another basic observation is that if there is a local flow in the fluid, then any 
object, be it a vortex or an antivortex, caught in it would just flow along in the 
same direction as the local flow. This is a consequence of Galilean invariance. By 
drawing a simple picture you can see that this produces the same pattern of motion 
as discussed above. 


Chapter VI.4 
The o Models as Effective Field Theories 


The Lagrangian as a mnemonic 


Our beloved quantum field theory has had two near death experiences. The first 
started around the mid-1930s when physical quantities came out infinite. But it 
roared back to life in the late 1940s and early 1950s, thanks to the work of the 
generation that included Feynman, Schwinger, Dyson, and others. The second 
occurred toward the late 1950s. As we have already discussed, quantum field 
theory seemed totally incapable of addressing the strong interaction: The coupling 
was far too strong for perturbation theory to be of any use. Many physicists— 
known collectively as the S-matrix school—felt that field theory was irrelevant for 
studying the strong interaction and advocated a program of trying to derive results 
from general principles without using field theory. For example, in deriving the 
Goldberger-Treiman relation, we could have foregone any mention of field theory 
and Feynman diagrams. 

Eventually, in a reaction against this trend, people realized that if some results 
could be obtained from general considerations such as notions of spontaneous 
symmetry breaking and so forth, any Lagrangian incorporating these general 
properties had to produce the same results. At the very least, the Lagrangian 
provides a mnemonic for any physical result derived without using quantum field 
theory. Thus was born the notion of long distance or low energy effective field 
theory, which would prove enormously useful in both particle and condensed 
matter physics (as we have already seen.) 


The strong interaction at low energies 


One of the earliest examples is the o model of Gell-Mann and Lévy, which 
describes the interaction of nucleons and pions. We now know that the strong 
interaction has to be described in terms of quarks and gluons. Nevertheless, at 
long distances, the degrees of freedom are the two nucleons and the three pions. 


The proton and the neutron transform as a spinor wy = (7 ) under the SU(2) 


of isospin. Consider the kinetic energy term Wiydy = Wriydw, + Wriy de. 
We note that this term has the larger symmetry SU (2); x SU(2)z, with the left 
handed field y, and the right handed field yy, transforming as a doublet under 
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SU (2), and SU (2) p, respectively. [The SU (2) of isospin is the diagonal subgroup 
of SU (2); x SU(2)p.] We can write y, ~ (4, 0) and vp ~ (0, 4). 

Now we see a problem immediately: The mass term my = (Wiwep + hc.) 
is not allowed since W, Wp ~ (4, 4), a 4-dimensional representation of SU (2), x 
SU (2), a group locally isomorphic to SO (4). 

At this point, lesser physicists would have said, what is the problem, we knew 
all along that the strong interaction is invariant only under the SU (2) of isospin, 
which we will write as su (2). Under SU (2), the bilinears constructed out ot vy 
and yp transform as 5 x + =0 + 1, the singlet being ww and the triplet piy>r? wv. 
With only SU(2), symmetry, we can certainly include the mass term yy. 

To say it somewhat differently, to fully couple to the four bilinears we can 
construct out of y, and yp, namely ww and Wiy>r’y, we need four meson 
fields transforming as the vector representation under 5O(4). But only the three 
pion fields are known. It seems clear that we only have SU (2); symmetry. 

Nevertheless, Gell-Mann and Lévy boldly insisted on the larger symmetry 
SU (2); x SU(2)z ~ SO(4) and simply postulated an additional meson field, 
which they called a, so that (o, m) form the 4-dimensionai representation. I leave 
it to you to verify that Wri(o Hit -T)pthe= =w(o +it - tys)W is invariant. 
Hence, we can write down the invariant Lagrangian 


L=pliyat+ go +it-tys)lW¥ +L, z) (1) 


where the part not involving the nucleons reads 
- 1 4 
Lio, tt) = (80) + (at) + +Ee + 3°) — “(0° +R (2) 


This is known as the linear 0 model. 

The o model would have struck most physicists as rather strange at the time it 
was introduced: The nucleon does not have a mass and there is an extra meson field. 
Aha, but you would recognize (2) as precisely the Lagrangian (IV.1.2) (for N = 4) 
that we studied, which exhibits spontaneous symmetry breaking. The four scalar 
fields (~, >), G3, 94) in (IV.1.2) correspond to (a, 7). With no loss of generality, 
we can choose the vacuum expectation value of ¢ to point in the 4th direction, 
namely the vacuum in which (0| 0 |(0) = 42/4 = v and (0| # |0) = 0. Expanding 
o =v +o’ we see immediately that the nucleon has a mass M = gv. You should 
not be surprised that the pion comes out massless. The meson associated with the 
field 0’, which we will call the 0 meson, has no reason to be massless and indeed 
is not. 

Can the all-important parameter v be related to a measurable quantity? Indeed. 
From Chapter 1.9 you will recall that the axial current is given by Noether’s 
theorem as Vs = WV yp¥5(t7/2)¥ + 2°3,0 —09,7°. After o acquires a vacuum 
expectation value, Js contains a term —v9,,7°. This term implies that the matrix 
element (0| Js |) = ivk,,, where k denotes the momentum of the pion, and 
thus v is proportional to the f defined in Chapter IV.2. Indeed, we recognize the 
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mass relation M = gv as precisely the Goldberger-Treiman relation (IV.2.7) with 
F (0) = 1 (see Exercise VI.4.4). 


The nonlinear oc model 


It was eventually realized that the main purpose in life of the potential in (oc, 7) 
is to force the vacuum expectation values of the fields to be what they are, so the 
potential can be replaced by a constraint o? + 7? = v*. A more physical way of 
thinking about this point is by realizing that the o meson, if it exists at all, must be 
very broad since it can decay via the strong interaction into two pions. We might 
as well force it out of the low energy spectrum by making its mass large. By now, 
you have learned from Chapters IV.1 and V.1 that the mass of the o meson, namely 
/2u, can be taken to infinity while keeping v fixed by letting 2 and A tend to 
infinity, keeping their ratio fixed. 

We will now focus on £(c, 7). Instead of thinking abut £(o, 7) = 1[(aa)? + 
(az)7] with the constraint 0? + 7° = v”, we can simply solve the constraint and 
plug the solution o = Jv? — x? into the Lagrangian, thus obtaining what is known 
as the nonlinear o model: 

2 aay? 
ee | = =(0#) + sat aH) +--- (3) 


b=? [oa + 
2 
Note that £ can be written in the form £ = (82°)G*"(z)(82°"); some people like 
to think of G?? as a “metric” in field space. [Incidentally, recall that way back in 
Chapter 1.3 we restricted ourselves to the simplest possible kinetic energy term 
1a)”, rejecting possibilities such as U(~)(8g)*. But recall also that in Chapter 
IV.3 we noted that such a term would arise by quantum fluctuations. ] 

In accordance with the philosophy that introduced this chapter, any Lagrangian 
that captures the correct symmetry properties should describe the same low energy 
physics. This means that anybody, including you, can introduce his or her own 
parametrization of the fields. 

The nonlinear o model is actually an example of a broad class of field theories 
whose Lagrangian has a simple form but with the fields appearing in it subject to 
some nontrivial constraint. An example is the theory defined by 


2 
L(U)= - tr(a,UT - a#U) (4) 
with U(x) a matrix-valued field and an element of SU (2). Indeed, if we write 
U =e/D"7 we see that £(U) = 4(a7)* + (1/2 f?)(z - a2)? + - + - , identical to 
(3) up to the terms indicated. 

There is considerably more we can say about the nonlinear o models and their 
applications in particle and condensed matter physics, but a thorough discussion 
would take us far beyond the scope of this book. Instead, I will develop some of 
their properties in the exercises and in the next chapter will sketch how they can 
arise in one class of condensed matter systems. 


VI-4.1. 


VI.4.2. 


VI4.3. 


VI44, 
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Exercises 


Show that the vacuum expectation value of (0, 7) can indeed point in any 
direction without changing the physics. At first sight, this statement seems strange 
since, by virtue of its y; coupling to the nucleon, the pion is a pseudoscalar field 
and cannot have a vacuum expectation without breaking parity. But ( , 77) are just 
Greek letters. Show that by a suitable transformation of the nucleon field parity 
is conserved, as it should be in the strong interaction. 


Calculate the pion-pion scattering amplitude up to quadratic order in the external 
momenta, using the nonlinear a model (3). [Hint: For help, see §. Weinberg, Phys. 
Rev. Lett. 17: 616, 1966.] 


Calculate the pion-pion scattering amplitude up to quadratic order in the external 
momenta, using the linear o model (2). Don’t forget the Feynman diagram 


involving o meson exchange. You should get the same result as in Exercise V1.4.2. 


Show that the mass relation M = gv amounts to the Goldberger-Treiman relation. 


Chapter VI.5 


Ferromagnets and Antiferromagnets 


Magnetic moments 


In Chapters IV.1 and V.3 I discussed how the concept of the Nambu-Goldstone 
boson originated as the spin wave in a ferromagnetic or an antiferromagnetic 
material. A cartoon description of such materials consists of a regular lattice on 
each site of which sits a local magnetic moment, which we denote by a unit vector 
i, with 7 labeling the site. In a ferromagnetic material the magnetic moments on 
neighboring sites want to point in the same direction, while in an antiferromagnetic 
material the magnetic moments on neighboring sites want to point in opposite 
directions. In other words, the energy is H =J }°_, i> fi; -n j Where i and 7 label 
neighboring sites. For antiferromagnets J > 0, and for ferromagnets J < 0. 

In a more microscopic treatment, we would start with a Hamiltonian (such as 
the Hubbard Hamiltonian) describing the hopping of electrons and the interaction 
between them. Within some approximate mean field treatment the classical vari- 
able n j would then emerge as the unit vector pointing in the direction of (cide A) 
with c ; and c; the electron creation and annihilation operators, respectively. But 
this is not a text on solid state physics. 


First versus second order in time 


Here we would like to derive an effective low energy description of the ferromagnet 
and antiferromagnet in the spirit of the o model description of the preceding 
chapter. Our treatment will be significantly longer than the standard discussion 
given in some field theory texts, but has the slight advantage of being correct. 

The somewhat subtle issue is what kinetic energy term we have to add to 
—H to form the Lagrangian L. Since for a unit vector 7 we have n - (dn/dt) = 
(d(# -n)/dt) =0, we cannot make do with one time derivative. With two deriva- 
tives we can form (dn/dt) - (dn/dt) and so 


an; an; 
pwrong = 2g2 5 ar Fy ~ Ji nj nj (1) 


<ij> 
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A typical field theory text would then pass to the continuum limit and arrive at the 
Lagrangian density 


b= (— SE - 2 YS (2) 


with the constraint [7i(x, tf)? = 1. This is another example of a nonlinear o model. 
Just as in the nonlinear 0 model discussed in Chapter V1.4, the Lagrangian looks 
free, but the nontrivial dynamics comes from the constraint. The constant c, (which 
is determined in terms of the microscopic variable J) is the spin wave velocity, as 
you can see by writing down the equation of motion (87/417) — CVn = 0. 

But you can feel that something is wrong. You learned in a quantum mechanics 
course that the dynamics of a spin variable S is first order in time. Consider the 

most basic example of a spin i in a constant magnetic field described by H = bs - B. 

ThendS /dt=i[FA, S]= = uB x 5. Besides, you might remember from a solid state 
physics course that in a ferromagnet the dispersion relation of the spin wave has 
the nonrelativistic form w ox k and not the relativistic form w* ox k? implied by (2). 

The resolution of this apparent paradox is based on the Pauli-Hopf identity: 
Given a unit vector 7 we can always write i = z'¢z, where z = (=! ) consists of 
two complex numbers such that z'z = ZZ) + 2529 = 1. Verify this! (A mathematical 
aside: Writing z, and z) out in terms of real numbers we see that this defines the 
so-called Hopf map S? — S?.) While we cannot form a term quadratic in # and 
linear in time derivative, we can write a term quadratic in the complex doublet 
z and linear in time derivative. Can you figure it out before looking at the next 
line? 

The correct version of (1) is 


Oz an; on, > ad 
cecorrect = oa ja +55 7 aT ny nj (3) 


The added term is known as the Berry’s phase term and has deep topological 
meaning. You should derive the equation of motion using the identity 


+ 92; 1: _ _ an; 
fas(s iar =i farsi. nj Xa (4) 


Remarkably, although zi (az; /dt) cannot be written simply in terms of 7,, its 
variation can be. 
Low energy modes in the ferromagnet and the antiferromagnet 


In the ground state of a ferromagnet, the magnetic moments all point in the same 
direction, which we can choose to be the z-direction. Expanding the equation 
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of motion in small fluctuations around this ground state n; = é, + dn; (where 
evidently é, denotes the appropriate unit vector) and Fourier transforming, we 


obtain 
2 i 
-S+h(k) iw (Fi) =0 (5) 


linking the two components én, (k) and 5n,(k) of d7(k). The condition i ;- i; = 1 
says that dn,(k) = 0. Here a is the lattice spacing and h(k) = 4J[2 — cos(k,a) — 
cos(kya)]~2J ak? for smallk. (lam implicitly working in two spatial dimensions 
as evidenced by k, and k,.) 

At low frequency the Berry term iw dominates the naive term w*/g”, which we 
can therefore throw away. Setting the determinant of the matrix equal to zero, we 
see that we get the correct quadratic dispersion relation w x k?. 

The treatment of the antiferromagnet is interestingly different. The so-called 
Néel state’ for an antiferromagnet is defined by fi pul 1é,. Writing 7 j= 
(—)/é, +8 n,;, we obtain 


-2; + fk) —siw ( én, (k) )=0 (6) 
fio 5+ fH] Lomk+) 


linking én, and dn, evaluated at different momenta. Here f(k) = 4J 
[2 + cos(k,a) + cos(k,a)] and Q =[z/a, 2/a]. The appearance of Q is due to 
(—1/ =e! 24), (1 will let you figure out the somewhat overly compact notation.) 
The antiferromagnetic factor (—1)/ explicitly breaks translation invariance and 
kicks in the momentum Q whenever it occurs. A similar equation links 6n ,(k) 
and én,(k + Q). Solving these equations, you will find that there is a high fre- 
quency branch that we are not interested in and a low frequency branch with the 
linear dispersion w ox k. 

Thus, the low frequency dynamics of the antiferromagnet can be described by 
the nonlinear o model (2), which when the spin wave velocity is normalized to 1 
can be written in the relativistic form: 


1 = > 
L= 3g2 Hn - on (7) 


‘Note that while the Néel state describes the lowest energy configuration for a classical 
antiferromagnet, it does not describe the ground state of a quantum antiferromagnet. The 
terms S;"S; + S; S7 in the Hamiltonian J 7 _;,. S, - S; flip the spins up and down. 
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Exercises 


VI1.5.1. Work out the two branches of the spin wave spectrum in the ferromagnetic case, 
paying particular attention to the polarization. 


VI.5.2. Verify that in the antiferromagnetic case the Berry’s phase term merely changes 
the spin wave velocity and does not affect the spectrum qualitatively as in the 
ferromagnetic case. 


Chapter VI.6 
Surface Growth and Field Theory 


In this chapter I will discuss a topic, rather unusual for a field theory text, taken 
from non-equilibrium statistical mechanics, one of the hottest growth fields in 
theoretical physics over the last few years. I want to introduce you to yet another 
area in which field theoretic concepts are of use. 

Imagine atoms being deposited randomly on some surface. This is literally how 
some novel materials are grown. The height A(x, t) of the surface as it grows is 
governed by the Kardar-Parisi-Zhang equation 

oh _ wh + A why? + n(x, f) (1) 
ot 2 
This equation describes a deceptively simple prototype of nonequilibrium dynam- 
ics and has a remarkably wide range of applicability. 

To understand (1), consider the various terms on the right-hand side. The term 
vVh (with v > 0) is easy to understand: Positive in the valleys of 4 and negative on 
the peaks, it tends to smooth out the surface. With only this term the problem would 
be linear and hence trivial. The nonlinear term (4./2)(Vh)* renders the problem 
highly nontrivial and interesting; I leave it to you as an exercise to convince 
yourself of the geometric origin of this term. The third term describes the random 
arrival of atoms, with the random variable 7 (x, t) usually assumed to be Gaussian 
distributed, with zero mean,' and correlations 


(n&, On, 1) = 20780 — ¥)5(t — 1') (2) 
In other words, the probability distribution for a particular 7(x, t) is given by 


P(n) « om? f a?xdt nt? 


Here x represents coordinates in D-dimensional space. Experimentally, D = 2 for 
the situation I described, but theoretically we are free to investigate the problem 
for any D. 


'There is no loss of generality here since an additive constant in 7 can be absorbed by 
the shifth > h+ct. 
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Typically, condensed matter physicists are interested in calculating the corre- 
lation between the height of the surface at two different positions in space and 
time: 


+ sae 
(Ih, 1) ~ AG, ’)P) = |e -— PX F (& s ) ; 3) 
The bracket {---) here and in (2) denotes averaging over different realizations 
of the random variable 7(X, t). On the right-hand side of (3) I have written the 
dynamic scaling form typically postulated in condensed matter physics, where x 
and z are the so-called roughness and dynamic exponents. The challenge is then 
to show that the scaling form is correct and to calculate x and z. Note that the 
dynamic exponent z (which in general is not an integer) tells us, roughly speaking, 
how many powers of space is worth one power of time. (For A = 0 we have simple 
diffusion for which z = 2.) Here f denotes an unknown function. 
I will not go into more technical details. Our interest here is to see how this 
problem, which does not even involve quantum mechanics, can be converted into 
a quantum field theory. Start with 


—. Dy x 2 = 
z= { Dh | Dye 2? f dP xdtnG.t)*s E —vWh — =(vhy — nG.n| 
(4) 
Integrating over n, we obtain Z = f Dh e~5 with the action 
Sth) = — | dx dt E —vV7h com] (5) 
~ 202 ar 2 


You will recognize that this describes a nonrelativistic field theory of a scalar field 
h(x, t). The physical quantity we are interested in is then given by 


(G9) 0a NP) = 5 | DheS[ACz, t) — AG’, t’YP (6) 


Thus, the challenge of determing the roughness and dynamic exponents in statis- 
tical physics is equivalent to the problem of determining the propagator 


DG. t= ; J Dh ea (&, tyh@, 0) 


of the scalar field h. 
Incidentally, by scaling t > t/v andh > ./o2/v h, we can write the action as 
I D a 2 & 2 ? 
sin => fa xdt|{——V*}h—=(Wh 7 
i= f arear|(2—v2) n— Econ] a 


with g = A*o0?/v°. Expanding the action in powers of h as usual 
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_1f wpe a o\,F_ (2 - *) ps4 
S(h) = 5 fa x dt {[( Vv Ja] g(VhA) 7 VJ A+ 4 (Vh)"}, 
(8) 


we recognize the quadratic term as giving us the rather unusual propagator 
1/(w* + k*) for the scalar field h, and the cubic and quartic term as describing 
the interaction. As always, to calculate the desired physical quantity we evaluate 
the functional or “path” integral 


7= | Dh s+ f dxdtJ (x,th(x,t) 


and then functionally differentiate repeatedly with respect to J. 

My intent here is not so much to teach you nonequilibrium statistical mechanics 
as to show you that quantum field theory can emerge in a variety of physical 
situations, including those involving only purely classical physics. Note that the 
“quantum fluctuations” here arise from the random driving term. Evidently, there 
is a close methodological connection between random dynamics and quantum 
physics. 


Exercises 


VI.6.1. An exercise in elementary geometry: Draw a straight line tilted at an angle 6 with 
respect to the horizontal. The line represents a segment of the surface at time f. 
Now draw anumber of circles of diameter d tangent to and on top of this line. Next 
draw another line tilted at angle @ with respect to the horizontal and lying on top 
of the circles, namely tangent to them. This new line represents a segment of the 
surface some time later (see Fig. VI.6.1). Note that Ah = d/cos 6 ~ d(1+ 467). 
Show that this generates the nonlinear term (A. /2)(Vh)? in the KPZ equation (1). 


Figure VI.6.1 


VI.6.2. 


V1.6.3. 


VIL6. Surface Growth and Field Theory 329 


For applications of the KPZ equation, see for example, T. Halpin—Healy and Y.- 
C. Zhang, Phys. Rep, 254: 215, 1995; A. L. Barabasi and H. E. Stanley, Fractal 
Concepts in Surface Growth. 


Show that the scalar field A has the propagator 1/(w? + k*). 


Field theory can often be cast into apparently rather different forms by a change 
of variable. Show that by writing U = 28" wecan change the action (7) to 


2 Dz 19 “tgp 
S= d% at (u—' Lu —u-'vu (9) 
2 
g ot 


a kind of nonlinear o model. 


Chapter V1.7 


Disorder: Replicas and 
Grassmannian Symmetry 


Impurities and random potential 


An important area in condensed matter physics involves the study of disordered 
systems, a subject that has been the focus of a tremendous amount of theoretical 
work over the last few decades. Electrons in real materials scatter off the impurities 
inevitably present and effectively move in a random potential. In the spirit of this 
book I will give you a brief introduction to this fascinating subject, showing how 
the problem can be mapped into a quantum field theory. 

The prototype problem is that of a quantum particle obeying the Schrédinger 
equation Hy = [~V? + V(x)]w = Ey, where V (x) is arandom potential (repre- 
senting the impurities) generated with the Gaussian white noise probability distri- 
bution P(V) = Ne > dP x(1/2g?)V (x) with the normalization factor N determined 
by f DV P(V) =1. The parameter g measures the strength of the impurities: the 
larger g, the more disordered the system. This of course represents an idealization 
in which interaction between electrons and a number of other physical effects are 
neglected. 

As in statistical mechanics we think of an ensemble of systems each of which 
is characterized by a particular function V (x) taken from the distribution P(V). 
We study the average or typical properties of the system. In particular, we might 
want to know the averaged density of states defined by p(F) = (tr 5(E — H)) = 
(>>, 5(E — E;)), where the sum runs over the ith eigenstate of H with correspond- 
ing eigenvalue E;. We denote by (O(V)) = { DV P(V) O(YV) the average of any 
functional O(V) of V (x). Clearly, Tee dE p(E) counts the number of states in 
the interval from E* to E* + SF, an important quantity in, for example, tunneling 
experiments. 


Anderson localization 
Another important physical question is whether the wave functions at a particu- 
lar energy E extend over the entire system or are localized within a characteristic 


length scale §(F). Clearly, this issue determines whether the material is a conduc- 
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tor or an insulator. At first sight, you might think that we should study 


S(x,y; C) = (>) 8(E — Evi @)¥,0)) 


which might tell us how the wave function at x is correlated with the wave function 
at some other point y, but § unsuitable because y*(x)y;(y) has a phase that 
depends on V. Thus, S would vanish when averaged over disorder. Instead, the 
correct quantity to study is 


K(x ~ ys E)= (0 (EE — EQW vy, 0) V7 O)Y;,@)) 


since y*(x)¥;(y)¥(y)¥;(x) is manifestly positive. Note that upon averaging 
over all possible V (x) we recover translation invariance so that K is a function of 
the separation |x — y| only. As jx — y| > oo, if K(x — y; E) ~ e795) the 
wave functions around the energy E are localized over the so-called localization 
length €(£). On the other hand, if K (x — y; E’) decreases as a power law of |x — y|, 
the wave functions are said to be extended. 

Anderson and his collaborators made the surprising discovery that localization 
properties depend on D, the dimension of space, but not on the detailed form of 
P(V) (anexample of the notion of universality). For D = 1 and 2 all wave functions 
are localized, regardless of how weak the impurity potential might be. This is a 
highly nontrivial statement since a priori you might think, as eminent physicists 
did at the time, that whether the wave functions are localized or not depends on the 
strength of the potential. In contrast, for D = 3, the wave functions are extended for 
E in the range (—E,,, E..). As E approaches the energy E,. (known as the mobility 
edge) from above, the localization length €(F) diverges as €(£) ~ 1/(E — E,)* 
with some critical esponent! 2. Anderson received the Nobel Prize for this work 
and for other contributions to condensed matter theory. 

Physically, localization is due to destructive interference between the quantum 
waves scattering off the random potential. 

When a magnetic field is turned on perpendicular to the plane of a D = 2 elec- 
tron gas the situation changes dramatically: An extended wave function appears 
at E = 0. For non-zero E, all wave functions are still localized, but with the lo- 
calization length diverging as (FE) ~ 1/|E|”. This accounts for one of the most 
striking features of the quantum Hall effect (see Chapter VI.2): The Hall conduc- 
tivity stays constant as the Fermi energy increases but then suddenly jumps by a 
discrete amount due to the contribution of the extended state as the Fermi energy 
passes through EF = 0. Understanding this behavior quantitatively poses a major 


f 


'This is an example of a quantum phase transition. The entire discussion is at zero 
temperature. In contrast to the phase transition discussed in Chapter V.3, here we vary E 
instead of the temperature. 
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challenge for condensed matter theorists. Indeed, many consider an analytic cal- 
culation of the critical exponent v as one of the “Holy Grails” of condensed matter 
theory. 


Green’s function formalism 


So much for a lightning glimpse of localization theory. Fascinating though the 
localization transition might be, what does quantum field theory have to do with 
it? This is after all a field theory text. Before proceeding we need a bit of formalism. 
Consider the so-called Green’s function G(z) = (tr[1/(z — H)}) in the complex z- 
plane. Since tr[1/(z — H)]= )-, 1/(z — E;), this function consists of a sum of 
poles at the eigenvalues FE. Upon averaging, the poles merge into a cut. Using the 
standard identity lim Im[1/(x + ie)] = —imd(x), we see that 


o(E)=—+ lim Im G(E + ie) (1) 
Tw e500 


So if we Know G(z) we know the density of states. 


The infamous denominator 


I can now explain how quantum field theory enters into the problem. We start by 
taking the logarithm of the identity (A.15) 


JK y= log f Do' Dye 9K 9+ I 949" J) 


(where as usual we have dropped an irrelevant term). Differentiating with respect 
to J‘ and J and then setting J* and J equal to 0 we obtain an integral representation 
for the inverse of a hermitean matrix: 


—ol-K. 
[ Do Doe #"* ¥g,9' 


K-,, = ———_—_ 
( dij f Det Dge-#' K-¥ 


(2) 
(Incidentally, you may recognize this as essentially related to the formula (1.7.14) 
for the propagator of a scalar field.) Now that we know how to represent 1/(z — H) 
we have to take its trace, which means setting i = j in (2) and summing. In our 
problem, H = —V* + V(x) and the index i corresponds to the continuous variable 
x and the summation to an integration over space. Replacing K by i(z — H) (and 
taking care of the appropriate delta function) we obtain 


tr 


. if d?x{agtag+{V(x)—zlo! 
= | dy f Dot Dye J Og 9HV)~z'P Oy) gt (y) 


= 3 
z-H f Dot Doe! Jf 4?10¢t8e+1V@)—zlv'9} ©) 
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This is starting to look like a scalar field theory in D-dimensional Euclidean space 
with the action S = f d?x{ag'dg + [V(x) — zlp'g}. [Note that for (3) to be well 
defined z has to be in the lower half-plane. } 

But now we have to average over V (x), that is, integrate over V with the prob- 
ability distribution P(V). We immediately run into the difficulty that confounded 
theorists for a long time. The denominator in (3) stops us cold: If that denominator 
were not there, then the functional integration over V (x) would just be the Gaus- 
sian integral you have learned to do over and over again. Can we somehow lift this 
infamous denominator into the numerator, so to speak? Clever minds have come 
up with two tricks, known as the replica method and the supersymmetric method, 
respectively. If you can come up with another trick, fame and fortune might be 
yours. 


Replicas 


The replica trick is based on the well-known identity (1/x) = im x"—!) which 
n> 
allows us to write that much disliked denominator as 


lim (| Dot Dye J Pxe'ae+1V@—zlelehyn—| 


n—70 


= = lim sf I Do! Dg,e if ax Yr _18¢48 aH LV x) —2lei 4a) 
Thus (3) becomes 


tr = tim i f ay 
z—H n-0 


n 
: D 
J (11 bebe, e fa x» _ {aed dg. 1V (x)—- zlply,) 2; (y) gf (y) (4) 


a=1 


Note that the functional integral is now over n complex scalar fields g,. The field 
gy has been replicated. 

Averaging over the random potential, we recover translation invariance; thus the 
integrand for { d? y does not depend on y and f{ d? y just produces the volume V 
of the system. Using (A.13) we obtain 


= iV lim | (II Dei De. e fax£o Ot) (5) 


Z7 nO 


(tr 


where 


2 
L(g) = Sou! 80, — 20'@a) 48 E (yo ol.) (6) 


a=1 a=] 
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We obtain a field theory (with a peculiar factor of i) of n scalar fields with a good 
old g* interaction invariant under O(n) (known as the replica symmetry.) Note 
the wisdom of replacing K by i(z — H); if we didn’t include the i the functional 
integral would diverge at large g, as you can easily check. For z in the upper half- 
plane we would replace K by —i(z — H). The quantity from which we can extract 
the desired averaged density of states is given by the propagator of the scalar field. 
Incidentally, we can replace 9, 0)¢; (0) in (5) by the more symmetric expression 
(U/n) Dy-1 P5Po- 

Absorbing V so that we are calculating the density of states per unit volume, 
we find 


G() =i lim | (r bebe.) e 5) (2 > eh0ox0 (7) 
=] b=] 


For any integer 7 the field theory is perfectly well defined, so the delicate step 
in the replica approach is in taking the n — 0 limit. There is a fascinating literature 
on this limit. (Consult a book devoted to spin glasses.) 

Some particle theorists used to speak disparagingly of condensed matter physics 
as dirt physics, and indeed the influence of impurities and disorder on matter is 
one of the central concerns of modern condensed matter physics. But as we see 
from this example, in many respects there is no mathematical difference between 
averaging Over randomness and summing over quantum fluctuations. We end 
up with a g* field theory of the type that many particle theorists have devoted 
considerable effort to studying in the past. Furthermore, Anderson’s surprising 
result that for D = 2 any amount of disorder, no matter how small, localizes all 
states means that we have to understand the field theory defined by (6) in a highly 
nontrivial way. The strength of the disorder shows up as the coupling g”, so no 
amount of perturbation theory in g* can help us understand localization. Anderson 
localization is an intrinsically nonperturbative effect. 


Grassmannian approach 


As I mentioned earlier, people have dreamed up not one, but two, tricks in dealing 
with the nasty denominator. The second trick is based on what we learned in Chap- 
ter II.5 on integration over Grassmann variables: Let (x) and 7(x) be Grassmann 
fields, then 


_ me i, \7! 
| DnDiae J PK" C det K =C (f DoDgie J 9 *) 


where C and C are two uninteresting constants that we can absorb into the defini- 
tion of D7 D7. With this identity we can write (3) as 
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1 
tr =i | d? 
z—-H | y 


J Dot DoDnDige J 4?#(80'8041V )—ziol oh H(979NHLV (2)—zHin 


o(y)e'(y) 


(8) 


and then easily average over the disorder to obtain 


(« )=! / Dot DoDnDige J PLO78 Mogt(x) (9) 
z — 
with 
= on. ot t 7 to + any + © tg + Any? 
L(y. n. g', g) = 9g! dg + ANON — z(g'o +N) + 5 g+nn) (10) 


We end up with a field theory with bosonic (commuting) fields yg! and g and 
fermionic (anticommuting) fields 7 and n interacting with a strength determined 
by the disorder. The action S exhibits an obvious symmetry rotating bosonic fields 
into fermionic fields and vice versa, and hence this approach is known in the 
condensed matter physics community as the supersymmetric method. (It is perhaps 
worth emphasizing that 7 and 7 are not spinor fields, which we underline by not 
writing them as y and yw. The supersymmetry here, perhaps better referred to 
as Grassmannian symmetry, is quite different from the supersymmetry in particle 
physics to be discussed in Chapter VHI.4.) 

Both the replica and the supersymmetry approaches have their difficulties, and 
I was not kidding when I said that if you manage to invent a new approach without 
some of these difficulties it will be met with considerable excitement by condensed 
matter physicists. 


Probing localization 


I have shown you how to calculate the averaged density of states o(E). How do 
we study localization? I will let you develop the answer in an exercise. From our 
earlier discussion it should be clear that we have to study an object obtained from 
(3) by replacing y(y)¢" (y) by e(y¢'(y)e(~)e"(y’). If we choose to think of the 
replica field theory in the language of particle physics as describing the interaction 
of some scalar meson, then rather pleasingly, we see that the density of states is 
determined by the meson propagator and localization is determined by meson- 
meson scattering. 


Exercises 


VL7.1. Work out the field theory that will allow you to study Anderson localization. {Hint: 
Consider the object 
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(2p) (eta) 


for two Complex numbers z and w. You will have to introduce two sets of replica 
fields, commonly denoted by y+ and g7.] {Notation: [1/(z — H) (x, y) denotes 
the xy element of the matrix or operator [1/(z — H)].} 


As another example from the literature on disorder, consider the following prob- 
lem. Place N points randomly in a D-dimensional Euclidean space of volume V. 
Denote the locations of the points by x; G =1,..., MN). Let 


pik 
f@=() [a aire 


Consider the N by N matrix H,, = f (%; —%;) . Calculate p(£), the density 
of eigenvalues of H as we average over the ensemble of matrices, in the limit 
N — oo, V — ov, with the density of points o =N/V (not to be confused with 
p(£) of course) held fixed. [Hint: Use the replica method and arrive at the field 
theory action 


At 
S(@) = | dx [ave + m*|gq\") — pe Eat 
a=! 


This problem is not entirely trivial; if you need help consult M. Mézard et al., 
Nucl. Phy. B559: 689, 2000, cond-mat/9906135. 


Chapter V1.8 


Renormalization Group Flow as a 
Natural Concept in High Energy and 
Condensed Matter Physics 


Therefore, conclusions based on the renormalization group arguments . . . 
are dangerous and must be viewed with due caution. So is it with all 
conclusions from local relativistic field theories. 


—J. Bjorken and 8. Drell, 1965 


It is not dangerous 


The renormalization group represents the most important conceptual advance in 
quantum field theory over the last three or four decades. The basic ideas were 
developed simultaneously in both the high energy and condensed matter phys- 
ics communities, and in some areas of research renormalization group flow has 
become part of the working language. 

As you can easily imagine, this is an immensely rich and multifaceted subject, 
which we can discuss from many different points of view, and a full exposition 
would require a book in itself. Unfortunately, there has never been a completely 
satisfactory and comprehensive treatment of the subject. The discussions in some 
of the older books are downright misleading and confused, such as the well-known 
text from which I learned quantum field theory and from which the quote above 
was taken. In the limited space available here, I will attempt to give you a flavor 
of the subject rather than all the possible technical details. I will first approach it 
from the point of view of high energy physics and then from that of condensed 
matter physics. As ever, the emphasis will be on the conceptual rather than the 
computational. As you will see, in spite of the order of my presentation, it is easier 
to grasp the role of the renormalization group in condensed matter physics than in 
high energy physics. 

I laid the foundation for the renormalization group in Chapter III.1—I do plan 
ahead! Let us go back to our experimentalist friend with whom we were discussing 
Ap" theory. We will continue to pretend that our world is described by a simple 
Ag" theory and that an approximation to order i suffices. 
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What experimentalists insist on 


Our experimentalist friend was not interested in the coupling constant A we wrote 
down on a piece of paper, a mere Greek letter to her. She insisted that she would 
accept only quantities she and her experimental colleagues can actually measure, 
even if only in principle. As a result of our discussion with her we sharpened our 
understanding of what a coupling constant is and learned that we should define a 
physical coupling constant by [see (III.1.4)] 


Ap(“) =A — 307? log (7 >) +008) (1) 
pe 


At her insistence, we learned to express our result for physical amplitudes in terms 
of 4 p(j), and not in terms of the theoretical construct A. In particular, we should 
write the meson-meson scattering amplitude as 


2 
M = —iAp(u) +iCAp(u) foe ( ~) 10g (4 ~) + 10g (4 ) 


+ O[Ap(t)’] (2) 


What is the physical significance of A p(jz)? To be sure, it measures the strength 
of the interaction between mesons as reflected in (2). But why one particular 
choice of 2? Clearly, from (2) we see that 1 p(jz) is particularly convenient for 
studying physics in the regime in which the kinematic variables s, t, and w are all 
of order jz”. The scattering amplitude is given by —iA p(z) plus small logarithmic 
corrections, (Recall from a footnote in Chapter III.3 that the renormalization point 
Sp = to = Ug = EL” is adopted purely for theoretical convenience and cannot be 
reached in actual experiments. For our conceptual understanding here this is not a 
relevant issue.) In short, A p(jz) is known as the coupling constant appropriate for 
physics at the energy scale yz. 

In contrast, if we were so idiotic as to use the coupling constant A p(2’) while 
exploring physics in the regime with s, t, and u of order jz, with yz vastly different 
from yt’, then we would have a scattering amplitude 


| p? pe? 
M = —idp(u) +iCA p(w’)? foe (4 =) + 10g (4 ) +108 (-)| 
+ OfAp(w’)’] (3) 


in which the second term [with log(”/ 2”) large] can be comparable to or larger 
than the first term. Thus, for each energy scale jz there is an “appropriate” coupling 
constant A p(j2). 

Subtracting (2) from (3) we can easily relate A p(t) and A p(w’) for u~ p’: 


Ap(u’') =Ap(u) + 3CA p(n)” log (4 ) + O[Ap(u)’] (4) 
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We can express this as a differential “flow equation” 
4 1 (yu) =6 2 3 5 
Mn p(t) = 6CA p(u)° + O(CA;) (5) 


As you have already seen repeatedly, quantum field theory is full of historical 
misnomers. The description of how A p(j4) changes with jz is known as the renor- 
malization group. The only appearance of a group concept here is the additive 
group of transformation 4 > jz + dy. 

For the conceptual discussion in Chapter III.1 and here, we don’t need to know 
what the constant C happens to be. If C happens to be negative, then the coupling 
\ p(t) will decrease as the energy scale jz increases, and the opposite will occur 
if C happens to be positive. (In fact, the sign is positive, so that as we increase the 
energy scale, Ap flows away from the origin.) 


Flow of the electromagnetic coupling 


The behavior of A is typical of coupling constants in 4-dimensional quantum field 
theories. For example, in quantum electrodynamics, the coupling e or equivalently 
a = e*/42, measures the strength of the electromagnetic interaction. The story is 
exactly as that told for the Ag* theory: Our experimentalist friend is not interested 
in the Latin letter e, but wants to know the actual interaction when the relevant 
momenta squared are of the order ”. Happily, we have already done the compu- 
tation: We can read off the effective coupling at momentum transferred squared 
gq? = ye” from (III.7.13): 


ep(u)’ =e" ~e*[1— e’ Mu’) + Oe) 


1+ e?M (pu?) 


Take yz much larger than the electron mass m but much smaller than the cutoff 
mass M. Then from (HI.7.13) 


e,+0e) (6) 


d 1 d 
w—ep() = ~ 30g MH) + O@)=+ 


du 2 12? 


We learn that the electromagnetic coupling increases as the energy scale in- 
creases. Electromagnetism becomes stronger as we go to higher energies, or equiv- 
alently shorter distances. 

Physically, the origin of this phenomenon is closely related to the physics of 
dielectrics. Consider a photon interacting with an electron, which we will call the 
test electron to avoid confusion in what follows. Due to quantum fluctuations, 
as described way back in Chapter I.1, spacetime is full of electron-positron pairs, 
popping in and out of existence. Near the test electron, the electrons in these virtual 
pairs are repelled by the test electron and thus tend to move away from the test 
electron while the positrons tend to move toward the test electron. Thus, at long 
distances, the charge of the test electron is shielded to some extent by the cloud 
of positrons, causing a weaker coupling to the photon, while at short distances the 
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coupling to the photon becomes stronger. The quantum vacuum is just as much a 
dielectric as a lump of actual material. 

You may have noticed by now that the very name “coupling constant” is a 
terrible misnomer due to the fact that historically much of physics w. . done at 
essentially one energy scale, namely “almost zero”! In particular, people speak of 
the fine structure “constant” @ = 1/137 and crackpots continue to try to “derive” 
the number 137 from numerology or some fancier method. In fact, a is merely the 
coupling “constant” of the electromagnetic interaction at very low energies. It is 
an experimental fact that a, more properly written as a p() = e%, (4) /42, varies 
with the energy scale yz we are exploring. But alas, we are probably stuck with the 
name “coupling constant.” 


Renormalization group flow 


In general, in a quantum field theory with a coupling constant g, we have the 
renormalization group flow equation 


p28 = Beg) (7) 
dy 


which is sometimes written as dg/dt = B(g) upon defining ¢ = log(jt/ 9). I will 
now suppress the subscript P on physical coupling constants. If the theory happens 
to have several coupling constants g,;,i = 1, -.--, N, then we have 


dg. 
— = fi(g1,°°*, en) (8) 


We can think of (¢1, ---, gy) as the coordinate of a particle in N-dimensicnal 
space, t as time, and £;(g1, ---. gy) a position dependent velocity field. As we 
increase 44 or t we would like to study how the particle moves or flows. For 
notational simplicity, we will now denote (g1, --- , gx) collectively as g. Clearly, 
those couplings at which £;(g*) (for all i) happen to vanish are of particular 
interest: g* is known as a fixed point. If the velocity field around a fixed point 
g* is such that the particle moves toward that point (and once reaching it stays 
there since its velocity is now zero) the fixed point is known as attractive or stable. 
Thus, to study the asymptotic behavior of a quantum field theory at high energies 
we “merely” have to find all its attractive fixed points under the renormalization 
group fiow. In a given theory, we can typically see that some couplings are flowing 
toward larger values while others are flowing toward zero. 

Unfortunately, this wonderful theoretical picture is difficult to implement in 
practice because we essentially have no way of calculating the functions £,(g). In 
particular, g* could well be quite large, associated with what is known as a strong 
coupling fixed point, and perturbation theory and Feynman diagrams are of no use 
in determining the properties of the theory there. Indeed, we know the fixed point 
structure of very few theories. 
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Happily, we know of one particularly simple fixed point, namely g* = 0, at 
which perturbation theory | is certainly applicable. We can always evaluate (8) 
perturbatively: dg; /dt = c} Ke j8kt dt 8; j8x81 +: --- (In some theories, the series 
starts with quadratic terms and in others, with cubic terms. Sometimes there is 
also a linear term.) Thus, as we have already seen in a couple of examples, the 
asymptotic or high energy behavior of the theory depends on the sign of f; in (8). 

Let us now join the film “Physics History” already in progress. In the late 1960s, 
experimentalists studying the so-called deep inelastic scattering of electrons on 
protons discovered that their data seemed to indicate that after being hit by a 
highly energetic electron, one of the quarks inside the proton would propagate 
freely without interacting strongly with the other quarks. Normally, of course, 
the three quarks inside the proton are strongly bound to each other to form the 
proton. Eventually, a few theorists realized that this puzzling state of affairs could 
be explained if the theory of strong interaction is such that the coupling flows 
toward the fixed point g* = 0. If so, then the strong interaction between quarks 
would actually weaken at higher and higher energy scales. 

All of this is of course now “obvious” with the benefit of hindsight, but dear 
students, remember that at that time field theory was pronounced as possibly 
unsuitable for young minds and the renormalizable group was considered “dan- 
gerous” even in a field theory text! 

The theory of strong interaction was unknown. But if we were so bold as to 
accept the dangerous renormalization group ideas then we might even find the 
theory of the strong interaction by searching for asymptotically free theories, which 
is what theories with an attractive fixed point at g* = 0 became known as. 

Asymptotically free theories are clearly wonderful. Their behavior at high ener- 
gies can be studied using perturbative methods. And so in this way the fundamental 
theory of the strong interaction, now known as quantum chromodynamics, about 
which more later, was found. 


Looking at physics on different length scales 
The need for renormalization groups is really transparent in condensed matter 
physics. Instead of generalities, let me focus on a particularly clear example, 
namely surface growth. Indeed, that was why I chose to introduce the Kardar- 


Parisi-Zhang equation in Chapter VI.6. We learned that to study surface growth 
we have to evaluate the functional or path integral 


Z(A) = | Dhe A, (9) 


with, you will recall, 


2 
S(h) = — -/ ax dt (= Oh py £ vn?) , (10) 
at 2 
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(a) 


(b) 


Figure V1.8.1 


This defines a field theory. As with any field theory and as I indicate, a cutoff A 
has to be introduced. We integrate over only those field configurations h(x, t) that 
do not contain Fourier components with Kk and w larger than A. (In principle, since 
this is a nonrelativistic theory we should have different cutoffs for k and for w, but 
for simplicity of exposition let us just refer to them together generically as A.) The 
appearance of the cutoff is completely physical and necessary. At the very least, 
on length scales comparable to the size of the relevant molecules, the continuum 
description in terms of the field A(x, t) has long since broken down. 

Physically, since the random driving term n(x, ft) is a white noise, that is, 7 
at X and at x’ (and also at different times) are not correlated at all, we expect the 
surface to look very uneven on a microscopic scale, as depicted in Figure VI.8. 1a. 
But suppose we are not interested in the detailed microscopic structure, but more 
in how the surface behaves on a larger scale. In other words, we are content to 
put on blurry glasses so that the surface appears as in Figure VI.8.1b. This is a 
completely natural way to study a physical system, one that we are totally familiar 
with in studying physics. We may be interested in physics over some length scale 
L and do not care about what happens on length scales much less than L. 

The renormalization group is the formalism that allows us to relate the physics 
on different length scales or, equivalently, physics on different energy scales. In 
condensed matter physics, one tends to think of length scales, and in particle 
physics, of energy scales. The modern approach to renormalization groups came 
out of the study of critical phenomena by Kadanoff, Fisher, Wilson, and others, 
as mentioned in Chapter V.3. Consider, for example, the Ising model, with the 
spin at each site either up or down and with a ferromagnetic interaction between 
neighboring spins. At high temperatures, the spins are randomly up and down. As 
the temperature drops toward the ferromagnetic transition point, islands of up spins 
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(we say up spins to be definite, we could just as easily talk of down spins) start 
to appear. They grow ever larger in size until the critical temperature 7, at which 
all the spins in the entire system point up. The characteristic length scale of the 
physics at any particular temperature is given by the typical size of the islands. The 
physically motivated block spin method of Kadanoff et al. treats blocks of up spin 
as one single effective up spin, and similarly for blocks of down spins. The notion 
of a renormalization group is then the natural one for describing these effective 
spins by an effective Hamiltonian appropriate to that length scale. 

It is more or less clear how to implement this physical idea of changing length 
scales in the functional integral (9). We are supposed to integrate over those 
h(k, w), with k and w less than A. Suppose we do only a fraction of what we 
are supposed to do. Let us integrate over those h(k, w) with k and w larger than 
A — 5A but smaller than A. This is precisely what we mean when we say that 
we don’t care about the fluctuations of h(x, t) on length and time scales less than 
(A —8A)71. 


Putting on blurry glasses 


For the sake of simplicity, let us go back to our favorite, the Ag* theory, instead 
of the surface growth problem. Recall from the preceding chapters the importance 
of the Euclidean Ag* theory in modern condensed matter theory. So, continue the 
i.9* theory to Euclidean space and stare at the integral 


Z(A) = J Dye J xt) (1) 
A 


The notation {, instructs us to integrate over only those field configurations 
g(x) = f[d4k/(2n)e** o(k) such that y(k) = 0 for |kl = Oo, K2)2 larger than 
A. As explained in the text this amounts to putting on blurry glasses with resolution 
L = 1/A: We do not admit or see fluctuations with length scales less than L. 

Evidently, the O(d) invariance, namely the Euclidean equivalent of Lorentz 
invariance, will make our lives considerably easier. In contrast, for the surface 
growth problem we will need special glasses that blur space and time differently.! 

We are now ready to make our glasses blurrier by letting A > A — 5A (with 
5A > 0). Write g = g, + ¢,, (s for “smooth” and w for “wriggly’’), defined such 
that the Fourier components g, (k) and ¢,, (k) are nonzero only for |k] < (A — 5A) 
and (A — 6A) < |k] < A, respectively. (Obviously, the designation “smooth” and 
“wriggly” is for convenience.) Plugging into (11) we can write 


‘In condensed matter physics, the so-called dynamical exponent z measures this differ- 
ence. More precisely, in the context of the surface growth problem, the correlator (intro- 
duced in Chapter VI.6) satisfies the dynamic scaling form given in (VI.6.3). Naively, the 
dynamical exponent z should be 2. (For a brief review of all this, see M. Kardar and A. Zee, 
Nucl. Phys. B464[FS]: 449, 1996, cond-mat/9507 112.) 
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Z(A) _— J Dy,e f d27xQ@) J Do, ,e- f 4xL (Gs ,Pw) (12) 
A-6A 


where all the terms in £,(9,, 9,,) depend on g,,. (What we are doing here is 
somewhat reminiscent of what we did in Chapter IV.3.) Imagine doing the integral 
over ¢,,. Call the result 


on f ax8L@,) = J Do,e7 J 2x10 
and thus we have 


Z(A) =| SA Dg,e- f d4x1£ 95) +5£ 95)] (13) 


There, we have done it! We have rewritten the theory in terms of the “smooth” 
field ¢,. 

Of course, this is all formal, since in practice the integral over ~,, can only be 
done perturbatively assuming that the relevant couplings are small. If we could do 
the integral over y,,, exactly, we might as well just do the integral over g and then 
we would have no need for all this renormalization group stuff. 

For pedagogical purposes, consider more generally £ = 4(8y)* + >, 4,9" + 
-. + (so that A, is the usual 4m? and A, the usual A.) Since terms such as 39,09, 
integrate to zero, we have 


l 1 
| d*xLi(9,, ) = | d®x5 @ Go) + 5m Fy +:- | 


which describes a field g,, interacting with both itself and a background field 
”;(x). By symmetry considerations 5£(g,) has the same form as &(g,) but with 
different coefficients. Adding 5£(y,) to £(@,) thus shifts* the couplings 4,, [and 
the coefficient of 4 (8,)7.] These shifts generate the flow in the space of couplings 
I described earlier. 

We could have perfectly well left (13) as our end result. But suppose we want 
to compare (13) with (11). Then we would like to change the fran in (13) to fa ; 
For convenience, introduce the real number b < lby A —5A =6A.In f A—sa WE 
are told to integrate over fields with |k| < bA.So all we have to do is make a trivial 
change of variable: Let k = bk’ so that |k’| < A. But then correspondingly we have 
to change x = x’/b so that e'** = e*’*", Plugging in, we obtain 


d7xL d= fat p-4 | 1 e299)? AG te: 14 
| xL£ (Ys) x [eer 4D gy" + (14) 


2 Terms such as (8¢)* can also be generated and that is why I wrote £ (@) with the (- - -) 
under which terms such as these can be swept. You can check later that for most applications 
these terms are irrelevant in the technical sense to be defined below. 
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(where 3’ = 3/dx’ = (1/b)3/ax). Define w’ by b*-4(8’¢,)* = (a’y’)* or in other 
words gy’ = b2 2-4)». Then (14) becomes 


1 
fate ow + > Ab at e/2VG-2) gn 4. | 
it 
Thus, if we define the coefficient of g” as 4), we have 
N= pinl2yd-2)-4y (15) 


an important result in renormalization group theory. 


Relevant, irrelevant, and marginal 


Let us absorb what this means. (For the time being, let us ignore 6£(¢,) to keep the 
discussion simple.) As we put on blurrier glasses, in other words, as we become 
interested in physics over longer distance scales, we can once again write Z(A) as 
in (11) except that the couplings A,, have to be replaced by 4} . Since b < 1 we see 
from (15) that the A,,’s with (1/2)(d — 2) — d > 0 get smaller and smaller and can 
eventually be neglected. A dose of jargon here: The corresponding operators g” 
(for historical reasons we revert for an instant from the functional integral language 
to the operator language) are called irrelevant. They are the losers. Conversely, the 
winners, namely the y”’s for which (n/2)(d — 2) — d < 0, are called relevant. 
Operators for which (n/2)(d — 2) ~ d =0 are called marginal. 

For example, take n = 2: m” = b~*m* and the mass term is always relevant 
in any dimension. On the other hand, take n = 4, and we see that A’ = b4-4) and 
yg’ is relevant for d < 4, irrelevant for d > 4, and marginal at d = 4. Similarly, 
A‘ = b*4-©4 and g® is marginal at d = 3 and becomes irrelevant for d > 3. 

We also see that d = 2 is special: All the y”’s are relevant. 

Now all this may ring a bell if you did the exercises religiously. In Exercise 
III.2.1 you showed that the coupling A,, has mass dimension [A,,] = (n/2)(2 — 
d) + d. Thus, the quantity (n/2)(d — 2) — d is just the length dimension of A,,. 
For example, for d = 4, 14 has mass dimension —2 and thus as explained in Chapter 
Ill.2 the y® interaction is nonrenormalizable, namely that it has nasty behavior at 
high energy. But condensed matter physicists are interested in the long distance 
limit, the opposite limit from the one that interests particle physicsts. Thus, it is 
the nasty guys like g° that become irrelevant in the long distance limit. 

One more piece of jargon: Given a scalar field theory, the dimension d at 
which the most relevant interaction becomes marginal is known as the critical 
dimension in condensed matter physics. For example, the critical dimension for a 
y® theory is 3. It is now just a matter of “high school arithmetic” to translate (15) 
into differential form. Write A) =A, + 6A,; then from b = 1 — (SA/A) we have 
bay = —[2(d — 2) — dA, (6A/A). 

Let us now be extra careful about signs. As I have already remarked, for 
(n/2)(d — 2) — d > 0 the coupling A,, (which, for definiteness, we will think of 
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as positive) get smaller, as is evident from (15). But since we are decreasing A to 
A — 6A, apositive 6A actually corresponds to the resolution of our blurry glasses 
L = A~' increasing to L + L(6A/A). Thus we obtain 


Ahn Mee my 
L_ = EK 2) a] iy (16) 


so that for (n/2)(d — 2) — d > 0 a positive A,, would decrease as L increases? 

In particular, for n = 4, L(dA/dL) = (4 — d)d. In most condensed matter 
physics applications, d < 3 and so A increases as the length scale of the physics 
under study increases. The ¢* coupling is relevant as noted above. 

The 6£(g,), which we provisionally neglected, contributes an additional 
term, which we call dynamical in contrast to the geometrical or “trivial’’ term dis- 
played, to the right-hand side of (16). Thus, in general L(dA,/dL) = 
—[(n/2)(4 — 2) —d]A, + K(d,n,-++,A;,+++), with the dynamical term K de- 
pending not only on d and n, but also on all the other couplings. [For example, in 
(5) the “trivial” term vanishes since we are in 4—dimensional spacetime; there is 
only a dynamical contribution. ] 

As you can see from this discussion, a more descriptive name for the renormal- 
ization group might be “the trick of doing an integral a little bit at a time.” 


Exploiting symmetry 


To determine the renormalization group flow of the coupling g we can repeat the 
same type of computation we did to determine the flow of the coupling A and of e 
in our two previous examples, namely we would calculate, to use the language of 
particle physics, the amplitude for h-h scattering to one loop order, But instead, 
let us follow the physical picture of Kadanoff et al. In Z(A) = f Dh eS we 
integrate over only those h (k, ) with & and w larger than A — 6A but less than A. 

I will now show you how to exploit the symmetry of the problem to minimize 
our labor. The important thing is not necessarily to learn about the dynamics of 
surface growth, but to learn the methodology that will serve you well in other 
situations. I have picked a particularly “difficult” nonrelativistic problem whose 
symmetries are not manifest, so that if you master the renormalization group for 
this problem you will be ready for almost anything. ; 

Imagine having done this partial integration and call the result f Dh e~5™). At 
this point you should work out the symmetries of the problem as indicated in the 
exercises. Then you can argue that S(h) must have the form 


(ny 1 | a? 8 put) nw Scvay?] 4... 
smy=5 fa x ai|(as pv’) h af cvny?] + , a7 


t 


3 Note that what appears on the right-hand side is minus the length dimension of Ans Not 
the length dimension (n/2)(d — 2) — d as one might have guessed naively. 
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depending on two parameters a and f. The (- - -) indicates terms involving higher 
powers of h and its derivatives. The simplifying observation is that the same 
coefficient a multiplies both h/t and (g/2)(Vh)*. Once we know o and B 
then by suitable rescaling we can bring the action $(h) back into the same form 
as S(h) and thus find out how g changes. Therefore, it suffices to look at the 
(dh/dt)* and (V2h) terms in the action, or equivalently at the propagator, which 
is considerably simpler to calculate. As Rudolf Peierls once said’ to the young 
Hans Bethe, “Erst kommt das Denken, dann der Integral.” (Roughly, “First think, 
then do the integral.”) We will not do the computation here. Suffice it to note that 
g has the high school dimension of (length)? ~*) (see Exercise VI1.8.5). Thus, 
according to the preceding discussion we should have 


LSB = 12— Dg teng?+-- (18) 
dL 

A detailed calculation is needed to determine the coefficient cp, which obvi- 
ously depends on the dimension of space D since the Feynman integrals depend on 
D. The equation tells us how g, an effective measure of nonlinearity in the phys- 
ics of surface growth, changes when we change the length scale L. For the record, 
Cp = [S(D)/4(22)?](2D — 3)/D, with S(D) the D-dimensional solid angle. The 

interesting factor is of course (2D — 3), changing sign between” D = 1 and 2. 


Localization 


As I said earlier, renormalization group flow has literally become part of the lan- 
guage of condensed matter and high energy physics. Let me give you another 
example of the power of the renormalization group. Go back to Anderson local- 
ization (Chapter V1.7), which so astonished the community at the time. People 
were surprised that the localization behavior depends so drastically on the dimen- 
sion of space D, and perhaps even more so, that for D = 2 all states are localized no 
matter how weak the strength of the disorder. Our usual physical intuition would 
say that there is a critical strength. As we will now see, both features are quite 
naturally accounted for in the renormalization group language. Already, you see 
in (18) that D enters in an essential way. 

I now after you a heuristic but beautiful (at least to me) argument given by 
Abrahams, Anderson, Licciardello, and Ramakrishnan, who as a result became 
known to the condensed matter community as the “Gang of Four.” First, you have 
to understand the difference between conductivity o and conductance G in solid 


4 John Wheeler gave me similar advice when I was a student: “Never calculate without 
first knowing the answer.” 

> Incidentally, the theory is exactly solvable for D = 1 (with methods not discussed in 
this book). 
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state physics lingo. Conductivity® is defined by J =cE, where J measures the 
number of electrons passing through a unit area per unit time. Conductance G is 
the inverse of resistance (the mnemonic: the two words rhyme). Resistance R is 
the property of a lump of material and defined in high school physics by V = 7 R, 
where the current J measures the number of electrons passing by per unit time. 
To relate o and G, consider a lump of material, taken to be a cube of size L, 
with a voltage drop V across it. Then J = JL? =a EL? =a(V/L)L*? =a LV 
and thus’ G(L) = 1/R =1/V =aL. Next, let us go to two dimensions. Consider 
a thin sheet of material of length and width L and thickness a < L. (We are doing 
real high school physics, not talking about some sophisticated field theorist’s idea 
of two dimensional space!) Again, apply a voltage drop V over the length L: 
[= J(aL)=cEalL =a(V/L)aL =oVa and so G(L) =1/R=1/V =caa.I 
will let you go on to one dimension: Consider a wire of length L and width and 
thickness a. In this way, we obtain G(L) « LP-?2. Incidentally, condensed matter 
physicists customarily define a dimensionless conductance g(L) =h#G(L) Je?. 

We also know the behavior of g(L) when g(L) is small or, in other words, when 
the material is an insulator for which we expect g(L) ~ ce—“/§ | with — some length 
characteristic of the material and determined by the microscopic physics. Thus, 
for g(L) small, L(dg/dL) = —(L/&)g(L) = g(L)[log g(L) — log c], where the 
constant log c is negligible in the regime under consideration. 

Putting things together, we obtain 


P(g 


ide = {inet forsma (19) 


log g+.--- for small g 


First, a trivial note: in different subjects, people define B(g) differently (without 
affecting the physics of course). In localization theory, 8(g) is traditionally defined 
as d log g/d log L as indicated here. From the plot of 8(g) in Figure VI.6.2 you 
see that for D =2 (and D = 1) the conductance g(L) always flows toward 0 as 
we go to long distances (macroscopic measurements on macroscopic materials) 
regardless of where we start. In contrast, for D = 3, if gp the initial value of g is 
greater than a critical g, then g(L) flows to infinity (presumably cut off by physics 
we haven’t included) and the material is a metal, while if g9 < g., the material 


© Over the years I have asked anumber of high energy theorists how is it possible to obtain 
J =o, which manifestly violates time reversal invariance, if the microscopic physics of 
an electron scattering on an impurity atom perfectly well respects time reversal invariance. 
Very few knew the answer. The resolution of this apparent paradox is in the order of limits! 
Condensed matter theorists calculate a frequency and wave vector dependent conductivty 
a(@, i) and then take the limit w, k— 0 and e/a —> 0. Before the limit is taken, time 
reversal invariance holds. The time it takes the particle to find out that it is in a box of size 
of order 1/k is of order 1/(Dk) (with D the diffusion constant). The physics is that this 
time has to be much longer than the observation time ~ 1/w. 

7Sam Treiman told me that when he joined the U.S. Army as a radio operator he was 
taught that there were three forms of Ohm’s law: V =/R, J = V/R, and R= V/J. In the 
second equality here we use the fourth form. 
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Figure VI.8.2 


is an insulator. Incidentally, condensed matter theorists often speak of a critical 
dimension D,. at which the long distance behavior of a system changes drastically; 
in this case D, = 2. 


Effective description 


In a sense, the renormalization group goes back to a basic notion of physics, 
that the effective description can and should change as we move from one length 
scale to another. For example, in hydrodynamics we do not have to keep track 
of the detailed interaction among water molecules. Similarly, when we apply the 
renormalization group flow to the strong interaction, starting at high energies and 
moving toward low energies, the effective description goes from a theory of quarks 
and gluons to a theory of nucleons and mesons. In this more general picture then, 
we no longer think of flowing in a space of coupling constants, but in “the space 
of Hamiltonians” that some condensed matter physicists like to talk about. 


Exercises 


VL8.1. Show that the solution of dg/dt = —bg? + - - - is given by 


1 1 
ne _ ) 
ay a0) + 8rbt + (20) 


where we defined a(t) = g(t)? /4x. 
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VI. Field Theory and Condensed Matter 


In our discussion of the renormalization group, in Av* theory or in QED, for the 
sake of simplicity we assumed that the mass m of the particle is much smaller than 
jz and thus set m equal to zero, But nothing in the renormalization group idea tells 
us that we can’t flow to a mass scale below m. Indeed, in particle physics many 
orders of magnitude separate the top quark mass m, from the up quark mass m,,. 
We might want to study how the strong interaction coupling flows from some mass 
scale far above m, down to some mass scale jz below m, but still large compared to 
m,,. AS a crude approximation, people often set any mass m below yz equal to zero 
and any m above yz to infinity (i.e., not contributing to the renormalization group 
flow). In reality, as 42 approaches m from above the particle starts to contribute 
less and drops out as jz becomes much less than m. Taking either the Ag+ theory 
or QED study this so-called threshold effect. 


Show that (10) is invariant under the so-called Galilean transformation 


h(k, 1) > h'G, ON =h (E+ gut, tc) +e-¥+ 5 ut (21) 


Show that because of this symmetry only two parameters w and § appear in (17). 


In S(A) only derivatives of the field h can appear and not the field itself. (Since the 
transformation A(x, t) > A(x, t) + ¢ with c a constant corresponds to a trivial 
shift of where we measure the surface height from, the physics must be invariant 
under this transformation.) Terms involving only one power of A cannot appear 
since they are all total divergences. Thus, 5(A) must start with terms quadratic 
in h. Verify that the $(h) given in (17) is indeed the most general. A term 
proportional to (VA)? is also allowed by symmetries and is in fact generated. 
However, such a term can be eliminated by transforming to a moving coordinate 
frame h > A+ ct. 


Show that g has the high school dimension of (length)? ~2). [Hint: The form 
of S(h) implies that ¢ has the dimension of length squared and so A has the 
dimension (length)? 2” .] Comparing the terms V2h and g(Vh)? we determine 
the dimension of g. 


.6. Calculate the h propagator to one loop order. Extract the coefficients of the w? and 


k4 terms ina low frequency and wave number expansion of the inverse propagator 
and determine @ and £. 


Study the renormalization group flow of ¢ for D = 1, 2, 3. 


PART VII 
GRAND UNIFICATION 


Chapter VII.1 


Quantizing Yang-Mills Theory and 
Lattice Gauge Theory 


One reason that Yang-Mills theory was not immediately taken up by physicists is 
that people did not know how to calculate with it. At the very least, we should 
be able to write down the Feynman rules and calculate perturbatively. Feynman 
himself took up the challenge and concluded, after looking at various diagrams, 
that extra fields with ghostlike properties had to be introduced for the theory to be 
consistent. Nowadays we know how to derive this result more systematically. 

Consider pure Yang-Mills theory—it will be easy to add matter fields later. 
Follow what we have learned. Split the Lagrangian & = £9 + £, as usual into two 
pieces (we also choose to scale A > gA): 


Ly = —7 (0, A5 — ,A2) (1) 
and 
Ly _ —59(0,A7 _ OA) fare APH AY? 4 gO fe FP AN AC ASH AY (2) 


Then invert the differential operator in the quadratic piece (1) to obtain the prop- 
agator. This part looks the same as the corresponding procedure for quantum 
electrodynamics, except for the occurrence of the index a. Just as in electrody- 
namics, the inverse does not exist and we have to fix a gauge. 

I built up the elaborate Faddeev-Popov method to quantize quantum electrody- 
namics and as I noted, it was a bit of overkill in that context. But here comes the 
payoff: We can now turn the crank. Recail from Chapter III.4 that the Faddeev- 
Popov method would give us 


z= | DAe'S) A(A)SLf (A)] (3) 


with A(A) ={f Dgd[f(A,)]} | and S(A) = f d*x& the Yang-Mills action. (As 
in Chapter III.4, A, = gAg”' —i(8g)g | denotes the gauge transform of A. Here 
g = g(x) denotes the group element that defines the gauge transformation at x and 
is obviously not to be confused with the coupling constant.) 

Since A(A) appears in (3) multiplied by 5[f(A)J], in the integral over g we 
expect, for a reasonable choice of f(A), only infinitesimal g to be relevant. 


353 


354 VI. Grand Unification 


Let us choose f(A) =@A —o. Under an infinitesimal transformation, A> 
Aa — £76 AS + 3,07 and thus 


A(A) ={ | DO3[9A* — 0 — 84 (f70P AS — 3,0°)} (4) 
“=f / Dos[a" (F720 AS — 3,6%)]}*. 


where the “effectively equal sign” follows since A(A) is to be multiplied later by 


dL f(A) 
Let us write formally 


am sereobas — 3,09) = f dtyK%(x, »)8"O) (5) 


thus defining the operator K(x, y) = d4( fore as — 0,,5%)3 4) (x — y). Note 
that in contrast to electromagnetism here K depends on the gauge potential. The 
elementary result { d@5(K@) = 1/K for 6 and K real numbers can be generalized 
to f d@5(K@) = 1/ det K for@ areal vector and K anonsingular matrix. Regarding 
K(x, y) as a matrix, we obtain A(A) = det K, but we know from Chapter III.4 
how to represent the determinant as a functional integral over Grassmann variables: 
Write A(A) = [ DeDete'Ssros(c" +2), with 


Senost(Ct, €) = | d'xd* yc'(x)K® (x, y)ep(y) 
= / d*x{dch (x)8cy(x) — aMch (x) f2°AS (x)ep(x)] 
= / d*xdch(x) Deg (x) (6) 


and with D the covariant derivative for the adjoint representation, to which the 
fields c, and ci belong just like Av, The fields c, and cl are known as ghost fields 
because they violate the spin-statistics connection: Though scalar, they are treated 
as anticommuting. This “violation” is acceptable because they are not associated 
with physical particles and are introduced merely to represent A(A) ina convenient 
form. 

This takes care of the A(A) factor in (3). As for the d[f (A)] factor, we use the 
same trick as in Chapter III.4 and integrate Z over 0“(x) with a Gaussian weight 
e G/28) f d*x0%(x) 5 that 5[ f (A)] gets replaced by e “/25? fatx@ary 

Putting it all together, we obtain 


i ~(i 4 24; t 
Zz, =| DADcDcieS™ (8/2) fa XGA)" +i Sghose(c" ,€) (7) 
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with € a gauge parameter. Comparing with the corresponding expression for an 
abelian gauge theory in Chapter III.4, we see that in nonabelian gauge theories we 
have a ghost action Sphos in addition to the Yang-Mills action. Thus, Lo and L, 
are changed to 


1 1 
by = — 7 pay — WAL — Ze OHALY’ + 8c, 8CQ(2) (8) 


and 
Ly — —38(3,A4 _ a,A0) Fare APH AY + 4B f 0" fee AY ACA AS 
— acl exe fA (x)ep(x) 9) 


We can now read off the propagators for the gauge boson and for the ghost 
field immediately from (8). In particular, we see that except for the group index 
a the terms quadratic in the gauge potential are exactly the same as the terms 
quadratic in the electromagnetic gauge potential in (III.4.8). Thus, the gauge boson 
propagator is 


C) E ~(1—8) 


k pKa 
k2 k2 


Sab (10) 


Compare with (III.4.9). From the term dct (x)dcy(x) in (8) we find the ghost 
propagator to be (i/k*)8qp- 

From £; we see that there is a cubic and a quartic interaction between the 
gauge bosons, and an interaction between the gauge boson and the ghost field, 
as illustrated in Figure VII.1.1. The cubic and the quartic couplings can be easily 
read off as 


of Pg uy (Ri — Koda + Balke — Kady + Bap lks — kyl (11) 
and 
ig? [ fee fo (By, 8vp — BupBvrd + fo FB wr 8up — Suv8pr) 
+ fo FP" (2 Bin — Sup8vr] (12) 


respectively. The coupling to the ghost field is 


ef?” pH (13) 


Obviously, we can exploit various permutation symmetries in writing these down. 
For instance, in (12) the second term is obtained from the first by the interchange 
{c, A} < {d, po}, and the third and fourth terms are obtained from the first and 
second by the interchange {a, 4} < {c, A}. 
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Figure VIIL1.1 


Unnatural act 


In a highly symmetric theory such as Yang-Mills, perturbating is clearly an unnatu- 
ral act as it involves brutally splitting £ into two parts: a part quadratic in the fields 
and the rest. Consider, for example, an exactly soluble single particle quantum me- 
chanics problem, such as the Schrodinger equation with V (x) = 1 — (1/ cosh x)?. 
Imagine writing V(x) = 4x? -+ W(x) and treating W(x) as a perturbation on the 
harmonic oscillator. You would have a hard time reproducing the exact spectrum, 
but this is exactly how we brutalize Yang-Mills theory in the perturbative approach: 
We took the “holistic entity” tr F,,, F“” and split itup into the “harmonic oscillator” 
piece tr(d,, A, — 8,Ay)° and a “perturbation.” 

If Yang-Mills theory ever proves to be exactly soluble, the perturbative approach 
with its mangling of gauge invariance is clearly not the way to do it. 


Lattice gauge theory 


Wilson proposed a way out: Do violence to Lorentz invariance rather than to 
gauge invariance. Let us formulate Yang-Mills theory on a hypercubic lattice 
in 4-dimensional Euclidean spacetime. As the lattice spacing a > 0 we expect 
to recover 4-dimensional rotational invariance and (by a Wick rotation) Lorentz 
invariance. Wilson’s formulation, known as lattice gauge theory, is easy to under- 
stand, but the notation is a bit awkward, due to the lack of rotational invariance. 
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Figure VII.1.2 


Denote the location of the lattice sites by the vector x;. On each link, say the one 
going from x; to one of its nearest neighbors x ;, we associate an N by N unitary 
matrix U;;. Consider the square, known as a plaquette, bounded by the four corners 
X;,X;, X,, and x, (with these nearest neighbors to each other.) See Figure VII.1.2. 
For each plaquette P we associate the quantity S(P) = Re tr U;;U ;,UjqU};, con- 
structed to be invariant under the local transformation 


Ui; > V/U;V; (14) 
The symmetry is local because for each site x; we can associate an inde- 
pendent V;. 
Wilson defined Yang-Mills theory by 


z= | Md e!V/2f)) dip SP) (15) 


where the sum is taken over all the plaquettes in the lattice. The coupling strength f 
controls how wildly the unitary matrices U;,’s fluctuate. For small f, large values 
of S(P) are favored, and so the U;;’s are all approximately equal to the unit matrix 
(up to an irrelevant global transformation.) 

Without doing any arithmetic, we can argue by symmetry that in the continuum 
limit a — 0, Yang-Mills theory as we know it must emerge: The action is mani- 
festly invariant under local SU(N) transformation. To actually see this, define a 
field A,,(x) with p = 1, 2, 3, 4, permeating the 4-dimensional Euclidean space the 
lattice lives in, by 


Uj; _ ViieltAu@y, (16) 


where x = }(x;-+ x;) (namely the midpoint of the link U;; lives on) and is the 


direction connecting x; to x; (namely pax j — *;)/a is the unit vector in the jz 
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direction.) The V’s just reflect the gauge freedom in (14) and obviously do not enter 
into the plaquette action $(P) by construction. I will let you show in an exercise 
that 


U;,U 


- 2 3 
ifU je Uy; = ef" Fuvt a) (17) 


with F,,, the Yang-Mills field strength evaluated at the center of the plaquette. 
Indeed, we could have discovered the Yang-Mills field strength in this way. I hope 
that you start to see the deep geometric significance of F,,,,. Continuing the exercise 
you wiil find that the action on each plaquette comes out to be 


S(P) = Re tr elt Fut O(a") 
=Retr[l+ia’F,, — ja‘F,,F,, + O@))=t1— att FF, +:°: 
(18) 


and soup to an irrelevant additive constant we recover in (15) the Yang-Mills action 
in the continuum limit. Again, it is worth emphasizing that without going through 
any arithmetic we could have fixed the a‘ term in (18) (up to an overall constant) 
by dimensional analysis and gauge invariance.! 

The Wilson formulation is beautiful in that none of the hand-wringing over 
gauge fixing, Faddeev-Popov determinant, ghost fields, and so forth is necessary 
for (15) to make sense. Recalling Chapter V.3 you see that (15) defines a statistical 
mechanics problem like any other. Instead of integrating over some spin variables 
say, We integrate over the group SU(N) for each link. Most importantly, the lattice 
gauge formulation opens up the possibility of computing the properties of a highly 
nontrivial quantum field theory numerically. Lattice gauge theory 1s a thriving area 
of research. For a challenge, try to incorporate fermions into lattice gauge theory: 
This is a highly nontrivial and ongoing problem because fermions and spinor fields 
are naturally associated with SO (4), which does not sit well on a lattice. 


Wilson loop 


Field theorists usually deal with local observables, that is, observables defined at 
a spacetime point x, such as J“(x) or tr F,,,(x) F*"(x), but of course we can also 
deal with nonlocal observables, such as é gf, co RPA in electromagnetism, where 
the line integral is evaluated over a closed curve C’. The gauge invariant quantity 
in the exponential is equal to the electromagnetic flux going through the surface 
bounded by C. (Indeed, recall Chapter IV.4.) 

Wilson pointed out that lattice gauge theory contains a natural gauge invariant 
but nonlocal observable W(C) = tr U;;U jx. --UnmUmi, where the set of links 
connecting x; to x; to x, et cetera and eventually to x,, and back to x; traces out 


1The sign can be easily checked against the abelian case. 
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a loop called C. Referring to (16) we see that W(C), known as the Wilson loop, 
is the trace of a product of many factors of e'?4u. Thus, in the continuum limit 
a — 0, we have evidently 


W(C) =tr Pe! fo dxHAy (19) 


with C now an arbitrary curve in Euclidean spacetime. Here P denotes path 
ordering, clearly necessary since the A,,’s associated with different segments of 
C, being matrices, do not commute with each other. [Indeed, P is defined by the 
lattice definition of W(C).] 

To understand the physical meaning of the Wilson loop, Recall Chapters 1.4 
and I.5, To obtain the potential energy E between two oppositely charged lumps 
we have to compute 


. : 4 : 
Jim : | DAdeSMaweu(Alti f dtxAyJ* _ giET 
> 0o 


For two lumps held at a distance R apart we plug in 
J# a) = {BO H)— BOLE — (R, 0, OV} 


and see that we are actually computing the expectation value 
i(f,, dx#Ay— fi. ax¥A,), ; 

{e 7" 2 ) in a fluctuating electromagnetic field, where C, and Cy 
denote two straight line segments at ¥ = (0, 0, 0) and x = (R, 0, 0), respectively. 
It is convenient to imagine bringing the two lumps together in the far future (and 
similarly in the far past). Then we deal instead with the manifestly gauge invariant 
quantity (e' $c atau where C is the rectangle shown in Figure VII.1.3. Note that 
for T large loge’ $. PAY ~ iE(R)T, which is essentially proportional to the 
perimeter length of the rectangle C. 

As we will discuss in Chapter VIL3 and as you have undoubtedly heard, the 
currently accepted theory of the strong interaction involves quarks coupled to a 
nonabelian Yang-Mills gauge potential A ,,. Thus, to determine the potential energy 
E(R) between a quark and an antiquark held fixed at a distance R from each other 
we “merely” have to compute the expectation value of the Wilson loop 


1 
WO) => f Mave V7 En Sw (cy (20) 
In lattice gauge theory we could compute log(W(C)) for C the large rectangle in 
Figure VII.1.3 numerically, and extract F(R). (We lost the i because we are living 
in Euclidean spacetime for the purpose of this discussion.) 
Quark confinement 


You have also undoubtedly heard that since free quarks have not been observed, 
quarks are generally believed to be permanently confined. In particular, it is 
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Figure VII.1.3 


believed that the potential energy between a quark and an antiquark grows linearly 
with separation E(R) ~ o R. One imagines a string tying the quark to the antiquark 
with a string tension a . If this conjecture is correct, then log(W(C)) ~ o RT should 
goas the area RT enclosed by C. Wilson calls this behavior the area law, in contrast 
to the perimeter law characteristic of familiar theories such as electromagnetism. 
To prove the area law in Yang-Mills theory is one of the outstanding challenges of 
theoretical physics. 


Exercises 


VII.1.1. Derive (17) and relate f to the coupling g in the continuum formulation of Yang- 
Mills theory. [Hint: Use the Baker-Campbell-Hausdorff formula 


ehek — pAt B+ HA. BH 4 LA BIHIB LB, Alte 


VII.1.2. Consider a lattice gauge theory in (D + 1)-dimensional space with the lattice 
spacing a in D-dimensional space and b in the extra dimension. Obtain the 
continuum D-dimensional field theory in the limit a > 0 with 6 kept fixed. 


VII.1.3. Study in (2) the alternative limit b — 0 with a kept fixed so that you obtain a 
theory on a spatial lattice but with continuous time. 


VII.1.4. Show that for lattice gauge theory the Wilson area law holds in the limit of strong 
coupling. [Hint: Expand (20) in powers of f~.] 


Chapter VII.2 


Electroweak Unification 


The scourge of massless spin 1 particles 


With the benefit of hindsight, we now know that Nature likes Yang-Mills theory. 
In the late 1960s and early 1970s, the electromagnetic and weak interactions were 
unified into an electroweak interaction, described by a nonabelian gauge theory 
based on the group SU(2) @ U(1). Somewhat later, in the early 1970s, it was 
realized that the strong interaction can be described by a nonabelian gauge theory 
based on the group SU (3). Nature literally consists of a web of interacting Yang- 
Mills fields. 

But when the theory was first proposed in 1954, it seemed to be totally incon- 
sistent with observations as they were interpreted at that time. As Yang and Mills 
themselves pointed out in their paper, the theory contains massless spin | particles, 
which were certainly not known experimentally. Thus, except for interest on the 
part of a few theorists (Schwinger, Glashow, Bludman, and others) who found the 
mathematical structure elegantly attractive and felt that nonabelian gauge theory 
must somehow be relevant for the weak interaction, the theory gradually sank into 
oblivion and was not part of the standard graduate curriculum in particle physics 
in the 1960s. 

Again with the benefit of hindsight, it would seem that there are only two logical 
solutions to the difficulty that experimentalists do not see any massless spin 1 
particles except for the photon: (1) the Yang-Mills particles somehow acquire mass, 
or (2) the Yang-Mills particles are in fact massless but are somehow not observed. 
We now know that the first possibility was realized in the electroweak interaction 
and the second in the strong interaction. 


Constructing the electroweak theory 


We now discuss electroweak unification. It is perhaps pedagogically clearest to 
motivate how we would go about constructing such a theory. As I have said before, 
this is not a textbook on particle physics and I necessarily will have to keep the 
discussion of particle physics to the bare minimum. I gave you a brief introduction 
to the structure of the weak interaction in Chapter IV.2. The other salient fact is that 
weak interaction violates parity, as mentioned in Chapter II.1. In particular, the left 
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handed electron field e, and the nght handed electron field ep, which transform 
into each other under parity, enter into the weak interaction quite differently. 

Let us start with the weak decay of the muon, 47 — e~ + +’, with v and 
v’ the electron neutrino and muon neutrino, respectively. The relevant term in the 
Lagrangian is Vv) y“42e7¥,¥z, with the left hand electron field e;, the electron 
neutrino field (which is left handed) v;, and so forth. The field jz; annihilates a 
muon, the field @, creates an electron, and so on. (Henceforth, we will suppress 
the word field.) As you probably know, the elementary constituents of matter form 
three families, with the first family consisting of v, e, and the up wu and down d 
quarks, the second of v’, js, and the charm c and strange s quarks, and so on. For 
our purposes here, we will restrict our attention to the first family. Thus, we start 
with vp ye, er y,.YL- 

As I remarked in Chapter III.2, a Fermi interaction of this type can be gen- 
erated by the exchange of an intermediate vector boson Wt with the coupling 
Widpyez +Weryve- 

The idea is then to consider an SU (2) gauge theory with a triplet of gauge bosons 
denoted by W’, witha = 1, 2, 3, Put v, and e; into the doublet representation and 
the right handed electron field ep into a singlet representation, thus 


wea(?). er (1) 


(The notation is such that the upper component of yr, is vz, and the lower compo- 
nent is €;.) 

The fields v; and e;, but not ep, listen to the gauge bosons Wwe Indeed, 
according to ([V.5.21) the Lagrangian contains 


Webi hy = (We bat yy, thc.) + Wide yy, 


where wie 2 wl_j iW? and so forth. We recognize t!*!? = 7 1 | it? as the rais- 
ing operator and the first two terms as (W'"'2, ye, +h.c.), precisely what we 
want. By design, the exchange of we generates the desired term v, y“e; 7 y,,Vz. 


We need more room 


We would hope that the boson W? we were forced to introduce would turn out 
to be the photon so that electromagnetism is included. But alas, W* couples to 
the current Ppt y“w, = py"4v, — €,¥,eL), not the electromagnetic current 
—(€1Y,e_ + €r¥,er). Oops! 

Another problem lurks. To generate a mass term for the electron, we need 
a doublet Higgs field g = (% ) in order to construct the SU (2) invariant term 
f ger in the Lagrangian so > that when @ acquires the vacuum expectation value 
(°) we will have 
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f¥L~er > FO, Oz, (‘) Cr = freer (2) 


But none of the SU (2) transformations leaves ( 0 )invariant: The vacuum expecta- 
tion value of g spontaneously breaks the entire SU (2) symmetry, leaving all three 
W bosons massive. There is no room for the photon in this failed theory. Aagh! 

We need more room. Remarkably, we can avoid both the oops and the aagh 
by extending the gauge symmetry to SU(2) @ U(1). Denoting the generator of 
U (1) by 4¥ (called the hypercharge) and the associated gauge potential by B 
[and their counterparts T° and We for SU(2)] we have the covariant derivative 
D, =4, — ig Wi T° —ig By, t With four gauge bosons, we dare to hope that one 
of 1 them ‘might turn out to be | the photon. 

The gauge potentials are normalized by the corresponding kinetic energy terms, 
£=—1(B,,) — 4(W2,)? + -- - with the abelian B,,, = 4,,B, — 4,B,, and non- 
abelian field strength we, =0,W) -9,We + eave we Wt. The generators T? are 
of course normalized by the commutation relations that define SU (2). In contrast, 
there is no commutation relation in the abelian algebra U(1) to fix the normaliza- 
tion of the generator $Y. Until this is fixed, the normalization of the U(1) gauge 
coupling g’ is not fixed. 

How do we fix the normalization of the generator $Y? By construction, we 
want spontaneous symmetry breaking to leave a linear combination of 7; and 4Y 
invariant, to be identified as the generator the massless photon couples to, namely 
the charge operator Q. Thus, we write 


Once we know 73 and }Y of any field, this equation tells us its charge. For example, 
QO(v,) = 4+ 4Y(v,) and Q(e,) = —4 + 4Y(e,). In particular, we see that the 
coefficient of 7; in (3) must be 1 since “the charges of v, and e, differ by 1. The 
relation (3) fixes the normalization of 5Y. 


Determining the hypercharge 


The next step is to determine the hypercharge of various multiplets in the theory, 
which in turn determines how B,, couples to these multiplets. Consider yy; . For 
e, to have charge —1, the doublet yw, must have 4 Y = —4. In contrast, the field 
€r has $¥ = —1 since 7; =O on ep. 

Given the hypercharge of yw; and ep we see that the invariance of the term 
fw Ger under SU(2) @ U(1) forces the Higgs field g to have 3¥ =+4. Thus, 
according to (3) the upper component of g has electric charge Q = +3 +4i=+1 
and the lower component Q = —} + 4 =0. Thus, we write g = (¢ ay Recall that 
el has the vacuum expectation value (° ). The fact that the electrically neutral field 


y® acquires a vacuum expectation value but the charged field y+ does not provide 
a a consistency check. 
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The theory works itself out 


Now that the couplings of the gauge bosons to the various fields, in particular, the 
Higgs field, are determined, we can easily work out the mass spectrum of the gauge 
bosons, as indeed, let me remind you, you have already done in Exercise IV.6.3! 

Upon spontaneous symmetry breaking yg —> (1/ /2)/( °) (the normalization is 
conventional): We simply plug in 


2,2 2 
_ tiene SU wtw-# UL 3 ofp \2 
& = (Dye) (D' 9) > WW + g (eM, — 8 By) (4) 


I trust that this is what you got! Thus, the linear combination g wi — g’B,, becomes 
massive while the orthogonal combination remains massless and is identified with 
the photon. It is clearly convenient to define the angle @ by tan 6 = g’/g. Then, 


3 : 
Z, = cosOW) — sind B,, (5) 
describes a massive gauge boson known as the Z boson, while the electromagnetic 
potential is given by A,, = sin 6 Wi + cos 6 B,,. Combine (4) and (5) and verify that 


the mass squared of the Z boson is Mz = v*(g* + g”)/4, and thus by elementary 
tigonometry obtain the relation 


M w= M z COS 74 (6) 
The exchange of the W boson generates the Fermi weak interaction 
2 


yg? 
2M2, 


- 4G_ 4 - 
Vey" Crery,YL = “yy bY €reLr¥pYL 


where the second equality merely gives the historical definition of the Fermi 
coupling G. Thus, 


= = (7) 


Next, we write the relevant piece of the covariant derivative 
W373 + ¢'B = 6Z, +sinOA,)T° + g/(—sinOZ 94,2 
gWiio +s py = B(cos wt sindA, g'(— sin OZ, + cosOA,, 5 


in terms of the physically observed Z and A. The coefficient of A,, works out to 
be g sin OT? + g’ cos@(Y/2) = g sin6(T* + Y/2); the fact that the combination 
Q =T° + Y/2 emerges provides a nice check on the formalism. Furthermore, we 
obtain 


e=gsin@g (8) 


Meanwhile, it is convenient to write g cos 9T? — g’ sin 6(Y /2), the coefficient 
of Z,, in the covariant derivative, in terms of the physically familiar electric charge 
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Q rather than the theoretical hypercharge Y: Thus, 
gcosOT? — g' sing(Q — T?) = *—(T? — sin? 6Q) 
cos @ 


In other words, we have determined the coupling of the Z boson to an arbitrary 
fermion field W in the theory: 


L=—8_Z, Wy"(T3 — sin? 0 Q)v (9) 
cos @ 


For example, using (9) we can immediately write the coupling of Z to leptons: 


L<= 


seg cul Ory" YL —é,y"“e,) + sin’ 6 éy“e] (10) 


Including quarks 


How to include the hadrons is now almost self evident. Given that only left handed 
fields participate in the weak interaction, we put the quarks of the first generation 
into SU (2) ® U(1) multiples as follows: 


a=(‘e) - us, de (11) 


where a = 1, 2, 3 denotes the color index, which I will discuss in the next chapter. 
The right handed quarks u& and d> are put into singlets so that they do not hear the 
weak bosons W“. Recall that the up quark u and the down quark d have electric 
charges 2 and —4 respectively. Referring to (3) we see 4Y = 3, 3, and —3 for 
Gr> UR, and dé, respectively. From (9) we can immediately read off the coupling 
of the Z boson to the quarks: 


& Ver 5 “2 
L= cos 0 ulz (apy uy —d,y"dz) — sm Ost] (12) 
Finally, [leave it to you to verify that of the four degrees of freedom contained 
in g (since y* and y” are complex) three are eaten by the W and Z bosons, leaving 
one physical degree of freedom H corresponding to the elusive Higgs particle that 
experimenters are still searching for as of this writing. 


The neutral current 
By virtue of its elegantly economical gauge group structure, this SU(2) ® U(1) 
electroweak theory of Glashow, Salam, and Weinberg ushered in the last great 
predictive era of theoretical particle physics. Writing (10) and (12) as 


_ _8 He 
h= cos 9 oH Steptons* Fouarks) 
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and using (6) we see that Z boson exchange generates a hitherto unknown neutral 
current interaction 
8 


) M2 (Ateptons + I quarks) "CS leptons + J quarks) 
Ww 


cceutral current = 


between leptons and quarks. By studying various processes described by 
oneutral current WE Can determine the weak angle @. Once @ is determined, we can 
predict g from (8). Once g is determined, we can predict My from (7). Once My 
is determined, we can predict M, from (6). 


Concluding remarks 


As I mentioned, there are three families of leptons and quarks in Nature, consisting 
of (v,,€, u, d), (Vy, #, ¢, 5), and (v,, T, t, b). The appearance of this repetitive 
family structure, about which the SU (2) @ U()) theory has nothing to say, rep- 
resents one of the great unsolved puzzles of particle physics. The three families, 
with the appropriate rotation angles between them, are simply incorporated into 
the theory by repeating what we wrote above. 

A more logical approach than the one given here would be to start with an 
SU (2) ® U()) theory with a doublet Higgs field with some hypercharge, and to 
say, “Behold, upon spontaneous symmetry breaking, one linear combination of 
generators remains unbroken with a corresponding massless gauge field.” I think 
that our quasi-historical approach is clearer. 

As [have mentioned on several occasions, Fermi’s theory of the weak interac- 
tion is nonrenormalizable. In 1999, ’t Hooft and Veltman were awarded the Nobel 
Prize for showing that the SU (2) @ U(1) electroweak theory is renormalizable, 
thus paving the way for the triumph of nonabelian gauge theories in describing the 
strong, electromagnetic, and weak interactions. I cannot go into the details of their 
proof here, but I would like to mention that the key 1s to start with the nonabelian 
analog of the unitary gauge (recall Chapter IV.7) and proceed to the R; gauge. At 
large momenta, the massive gauge boson propagators go as ~ (k,,k, /k*) in the 
unitary gauge, but as ~ (1/k*) in the R; gauge. The theory is then renormalizable 
by power counting. 


Exercises 


Vi.2.1. Unfortunately, the mass of the elusive Higgs particle H depends on the param- 
eters in the double well potential V = —u?o'y + A(y*¢)? responsible for the 
spontaneous symmetry breaking. Assuming that H is massive enough to de- 
cay into Wt + W~ and Z + Z, determine the rates for H to decay into various 
modes. 
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VII.2.2. Show that it is possible to stay with the SU (2) gauge group and to identify W? as 
the photon A, but at the cost of inventing some experimentally unobserved lepton 
fields. This theory does not describe our world: For one thing, it is essentially 
impossible to incorporate the quarks. Show this! [Hint: We have to put the leptons 
into a triplet of SU (2) instead of a doublet.] 


Chapter VII.3 


Quantum Chromodynamics 


Quarks 


Quarks come in six flavors, known as up, down, strange, charm, bottom, and 
top, denoted by u, d,s,c, b, and t. The proton, for example, is made of two 
up quarks and a down quark ~ (uud), while the neutral pion corresponds to 
~ (uit — dd)/V/2. Please consult any text on particle physics for details. 

By the late 1960s the notion of quarks was gaining wide acceptance, but two 
separate lines of evidence indicated that a crucial element was missing. In studying 
how hadrons are made of quarks, people realized that the wave function of the 
quarks in a nucleon does not come out to be antisymmetric under the interchange 
of any pair of quarks, as required by the Pauli exclusion principle. At around the 
same time, it was realized that in the ideal world we used to derive the Goldberger- 
Treiman relation and in which the pion is massless we can calculate the decay rate 
for the process 7° + y + y, as mentioned in Chapter IV.7. Puzzlingly enough, 
the calculated rate came out smaller than the observed rate by a factor of 9 = 3”. 

Both puzzles could be resolved in one stroke by having quarks carry a hitherto 
unknown internal degree of freedom that Gell-Mann called color. For any specified 
flavor, a quark comes in one of three colors. Thus, the up quark can be red, blue, 
or yellow. In a nucleon, the wave function of the three quarks will then contain a 
factor referring to color, besides the factors referring to orbital motion, spin, and 
so on. We merely have to make the color part of the wave function antisymmetric; 
in fact, we simply take it to be e%PY where a, B, and y denote the colors carried 
by the three quarks. With quarks in three colors, we have to multiply the amplitude 
for 2° decay by a factor of 3, thus neatly resolving the discrepancy between theory 
and experiment. 


Asymptotic freedom 


As I mentioned in Chapter VI.6, the essential clue came from studying deep 
inelastic scattering of electrons off nucleons. Experimentalists made the intriguing 
discovery that when hit hard the quarks in the nucleons act as if they hardly interact 
with each other, in other words, as if they are free. On the other hand, since quarks 
are never seen as isolated entities, they appear to be tightly bound to each other 
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within the nucleon. As I have explained, this puzzling and apparently contradictory 
behavior of the quarks can be understood if the strong interaction coupling flows 
to zero in the large momentum (ultraviolet) limit and to infinity or at least to some 
large value in the small momentum (infrared) limit. A number of theorists proposed 
searching for theories whose couplings would flow to zero in the ultraviolet limit, 
now known as asymptotic free theories. Eventually, Gross, Wilczek, and Politzer 
discovered that Yang-Mills theory is asymptotically free. 

This result dovetails perfectly with the realization that quarks carry color. The 
Nonabelian gauge transformation would take a quark of one color into a quark of 
another color. Thus, to write down the theory of the strong interaction we simply 
take the result of Exercise IV.5.6, 


L =-yaFar + qliy*D,, — mq (1) 
with the covariant derivative D,, = 3,, — i A,,. The gauge group is SU (3) with the 
quark field g in the fundamental representation. In other words, the gauge fields 
Ay= = Al, T?, where T? (a = 1,..., 8) are traceless hermitean 3 by 3 matrices. 
Explicitly, (A,q)* = A4 a(TA)%qP, where a, B = 1, 2, 3. The theory is known as 
quantum chromodynamics, or “OCD for short, and the non-abelian gauge bosons 
are known as gluons. To incorporate flavor, we simply write yi jel gjGy"D,, - 
m ;)q; for the second term in (1), where the index j goes over the f flavors. Note 
that quarks of different flavors have different masses. 


Infrared slavery 


The flip side of asymptotic freedom is infrared slavery. We cannot follow the renor- 
malization group flow all the way down to the low momentum scale characteristic 
of the quarks bound inside hadrons since the coupling g becomes ever stronger 
and our perturbative calculation of 8(g) is no longer adequate. Nevertheless, it is 
plausible although never proven that g goes to infinity and that the gluons keep the 
quarks and themselves in permanent confinement. The Wilson loop introduced in 
Chapter VII.1 provides the order parameter for confinement. 

In elementary physics forces decrease with the separation between interacting 
objects, so permanent confinement is a rather bizarre concept. Are there any 
other instances of permanent confinement? Consider a magnetic monopole in a 
superconductor. We get to combine what we learned in Chapters IV.4 and V.4 (and 
even VI.2)! A quantized amount of magnetic flux comes out of the monopole, but 
according to the Meissner effect a superconductor expels magnetic flux. Thus, a 
single magnetic monopole cannot live inside a superconductor. 

Now consider an antimonopole a distance R away (Figure VII.3.1). The mag- 
netic flux coming out of the monopole can go into the antimonopole, forming a tube 
connecting the monopole and the antimonopole and obliging the superconductor 
to give up being a superconductor in the region of the flux tube. In the language of 
Chapter V.4, it is no longer energetically favorable for the field or order parameter 
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Figure VII.3.1 


g to be constant everywhere; instead it vanishes in the region of the flux tube. The 
energy cost of this arrangement evidently grows as R (consistent with Wilson’s 
area law). 

In other words, an experimentalist living inside a superconductor would find that 
it costs more and more energy to pull a monopole and an antimonopole apart. This 
confinement of monopoles inside a superconductor is often taken to be a model of 
the yet-to-be-proven confinement of quarks. Invoking electromagnetic duality we 
can imagine a magnetic superconductor in contrast to the usual electric supercon- 
ductor. Inside a magnetic superconductor, electric charges would be permanently 
confined. Our universe may be likened to a color magnetic superconductor in which 
quarks (the analog of electric charges) are confined. 

On distance scales large compared to the radius of the color flux tube connecting 
a quark to an antiquark, the tube can be thought of as a string. Historically, that 
was how string theory originated. The challenge, boys and girls, is to prove that 
the ground state or vacuum of (1) is a color magnetic superconductor. 


Symmetries of the strong interaction 


Now that we have a theory of the strong interaction, we can understand the origin 
of the symmetries of the strong interaction, namely the isospin symmetry of 
Heisenberg and the chiral symmetry that when spontaneously broken leads to the 
appearance of the pion as a Nambu-Goldstone boson (as discussed in Chapters 
IV.2 and VI.4). 

Consider a world with two flavors, which is all that is relevant for a discussion 
of the pion. Introduce the notation u = q), d = qo, and q = (4) so that we can 
write the Lagrangian as 


i - 
L=——5 Fi LF + aliy"D,, — m)q 


4g uv 


with 
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where m,, and m, are the masses of the up and down quarks, respectively. If m, = 
m,, the Lagrangian is invariant under g — eg, corresponding to Heisenberg’s 
isospin symmetry. 

In the limit in which m, and m, vanish, the Lagrangian is invariant under 
q — e'? 5g, known as the chiral SU(2) symmetry, chiral because the right 
handed quarks gp and the left handed quarks q, transform differently. To the 
extent that m,, and m, are both much smaller than the energy scale of the strong 
interaction, chiral SU (2) is an approximate symmetry. 

The pion is the Nambu-Goldstone boson associated with the spontaneous break- 
ing of the chiral SU (2). Indeed, this is an example of dynamical symmetry break- 
ing since there is no elementary scalar field around to acquire a vacuum expec- 
tation. Instead, the strong interaction dynamics is supposed to drive the compos- 
ite scalar fields Zu and dd to “condense into the vacuum” so that (0| au |0) = 
(0| dd |0) become nonvanishing, where the equality between the two vacuum ex- 
pectation values ensures that Heisenberg’s isospin is not spontaneously broken, 
an experimental fact since there are no corresponding Nambu-Goldstone bosons. 
In terms of the doublet field g, the QCD vacuum is supposed to be such that 
(0| Gq |0) #0 while (0| zg |0) =0. 


Renormalization group flow 


The renormalization group flow of the QCD coupling is governed by 


48 _ 9) - Up Gg) 
dt B= 3 OTE 0) 


with the all-crucial minus sign. Here 
T)(G)57” _— fed peed (3) 


I will not go through the calculation of B(g) here, but having mastered Chapters 
VL8 and VII.1 you should feel that you can do it if you want to. At the very least, 
you should understand the factor g? and T>(G) by drawing the relevant Feynman 
diagrams. 

When fermions are included, 
11 4 g° 
=F = Be) =|-FH@ + §nh)| Fs (4) 


where 


T)(F)5° = te[T°(F)T?(F)] (5) 


' For a detailed calculation, see, e. g., 5S. Weinberg, The Quantum Theory of Fields, Vol. 
2, sec. 18.7. 
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Figure VII.3.2 


I do expect you to derive (4) given (2). For SU(N) T>(F) = 3 for each fermion in 
the fundamental representation. Note that asymptotic freedom is lost when there 
are too many fermions. 

You already solved an equation like (4) in Exercise VI.8.1. Let us define, in 
analogy to quantum electrodynamics, a; (42) = g()”/42, the strong coupling at 
the momentum scale jz. From (4) we obtain” 


Os (w) 6) 


a&3(Q) = — 
° 1+ (1/4m)(11 — $n p)ars(2) log(Q?/p?) 


showing explicitly that a;(Q) — 0 logarithmically as Q > oo. 


Electron-positron annihilation 


I have space to show you only one physical application. Experimentalists have 
measured the cross section o of e*e~ annihilation into hadrons as a function of 
the total center-of-mass energy E. The amplitude is shown in Figure VII.3.2. To 
calculate the cross section in terms of the amplitude, we have to go through what 
some people call “boring kinematics,” such as normalizing everything correctly, 
dividing by the flux of the two beams, and so forth. For the good of your soul, you 
should certainly go through this type of calculation at least once. Believe me, I did 
it more times than I care to remember. But happily, as I will now show you, we 
can avoid most of this grunge labor. First, consider the ratio 


? For the accumulated experimental evidence on a.(Q), see Figure 14,3 in F. Wilczek, 
in: V. Fitch et al., eds., Critical Problems in Physics, p. 281. 
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Figure VIL3.3 


R(E) = o(ete~ —> hadrons) 

o(ete- > pty) 

The kinematic stuff cancels out. In Figure VII.3.2 the half of the diagram involving 
the electron positron lines and the photon propagator also appears in the Feynman 
diagram e+e™ — y+” (Figure VII.3.3) and so cancels out in R(E). The blob in 
Figure VII.3.2, which hides all the complexity of the strong interaction, is given 
by (0| J#(0)|h), where J” is the electromagnetic current and the state |h) can 
contain any number of hadrons. To obtain the cross section we have to square the 
amplitude, include a 5-function for momentum conservation, and sum over all |/), 
thus arriving at 


> 21)484( py — Pet — Pe-) (Ol J” (0) |h)(A| J”) [0) (7) 
h 


[with q = p,+ + p.- =(E, 0)]. This quantity can be written as 


J d'xe!?* (0| J#(x) J" (0) |0) = J d‘xe!@ (0|[J#(x), J”(0)]|0) 


= 2 Im(i J d’xe!@ (0| TJ#(x) J”) [0)) 


(The first equality follows from E > 0 and the second from Exercise 1.8.4.) To 
determine this quantity, we would have to calculate an infinite number of Feynman 
diagrams involving lots of quarks and gluons. A typical diagram is shown in Figure 
VII.3.4. Completely hopeless! 

This is where asymptotic freedom rides to the rescue! From Chapter VIL.7 you 
learned that for a process at energy FE the appropriate coupling strength to use is 
g(E). But as we crank up E, g(E) gets smaller and smaller. Thus diagrams such 
as Figure VII.3.4 involving many powers of g(£) all fall away, leaving us with the 
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Figure V11.3.4 


diagrams with no power of g(£) (Fig. VII.3.5a) and two powers of g(E) (Figs. 
VII.3.5b,c,d). No calculation is necessary to obtain the leading term in R(E), since 
the diagram in Figure VII.3.5a is the same one that enters into ete™ — uty: 
We merely replace the quark propagator by the muon propagator (quark and muon 
masses are negligible compared to E). Athigh energy, the quarks are free and R(E) 
merely counts the square of the charge Q,, of the various quarks contributing at 
that energy: we predict 


RE) —> 3); (8) 


The factor of 3 accounts for color. 


Y 
Y 
(c) (d) 


{a) (b) 


Figure VII.3.5 
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Not only does QCD turns itself off at high energies, it tells us how fast it is 
turning itself off. Thus, we can determine how the limit in (8) is approached: 


2 
R(E) = (3) O7) (1+ C—~————___ + - ° 
(E) ( 22) ( (11— 4n,) log(E/1) ° 


I will leave it to you to calculate C. 


Dreams of exact solubility 


An analytic solution of quantum chromodynamics is something of a “Holy Grail” 
for field theorists (a grail that now carries a prize of one million dollars: see 
www.ams.org/claymath/). Many field theorists have dreamed that at least “pure” 
QCD, that is QCD without quarks, might be exactly soluble. After all, if any 4- 
dimensional quantum field theory turns out to be exactly soluble, pure Yang-Mills, 
with all its fabulous symmetries, is the most likely possibility. (Perhaps an even 
more likely candidate for solubility is supersymmetric Yang-Mills theory. We will 
touch on supersymmetry in Chapter VII.4.) 

Let me be specific about what it means to solve QCD. Consider a world with 
only up and down quarks with m,, and m, both set equal to zero, namely a world 
described by 


L= ~ aa Fe + giy"Dyuq (10) 


The goal would be to calculate something like the ratio of the mass of the p meson 
m., to the mass of the proton m p. 

To make progress, theoretical physicists typically need to have a small param- 
eter to expand in, but in trying to solve (10) we are confronted with the immediate 
difficulty that there is no such parameter. You might think that g is a parameter, but 
you would be mistaken. The renormalization group analysis taught us that ¢ (2) is 
a function of the energy scale fz at which it is measured. Thus, there is no particu- 
lar dimensionless number we can point to and say that it measures the strength of 
QCD. Instead, the best we can do Is to point to the value of sz at which (¢g ()* /43) 
becomes of order 1. This is the energy, known as A gc p, at which the strong in- 
teraction becomes strong as we come down from high energy (Fig. VII.3.6). But 
Agcp merely sets the scale against which other quantities are to be measured. 
In other words, if you manage to calculate mp it better come out proportional to 
A gcp Since A gc p is the only quantity with dimension of mass around. Similarly 
for m,. Put in precise terms, if you publish a paper with a formula giving m ,/m p 
in terms of pure numbers such as 2 and z, the field theory community will hail 
you as a conquering hero who has solved QCD exactly. 
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g(t)? 


Figure VIL.3.6 


The apparent trade of a dimensionless coupling g for a dimensional mass scale 
Agcp is known as dimensional transmutation, of which we will see another 
example in the next chapter. 


Exercises 


VII.3.1. Calculate C in (9). [Hint: If you need help, consult T. Appelquist and H. Georgi, 
Phys. Rev. D8: 4000, 1973; and A. Zee, Phys. Rev. D8: 4038, 1973.] 


VII.3.2. Calculate (2). 


Chapter VII.4 


Large N Expansion 


Inventing an expansion parameter 


Quantum chromodynamics is a zero-parameter theory, so it is difficult to give 
even a first approximation. In desperation, field theorists invented a parameter in 
which to expand QCD. Suppose instead of three colors we have N colors. ’t Hooft! 
noticed that as N — co remarkable simplifications occur. The idea is that if we 
can calculate m,/m p, for example, in the large N limit the result may be close to 
the actual value. People sometimes joke that particle physicists regard 3 as a large 
number, but actually the correction to the large N limit is typically of order 1/.N7, 
abut 10% in the real world. Particle physicists would be more than happy to be 
able to calculate hadron masses to this degree of accuracy. 

As with spontaneous symmetry breaking and a number of other important con- 
cepts, the large N expansion came out of condensed matter physics but nowadays 
is used routinely in all sorts of contexts. For example, people have tried a large N 
approach to solve high-temperature superconductivity and to fold RNA. 


Scaling the QCD coupling 


So, let the color group be U(N) and write 
N? = 
L=-—s tk, FY + li(g—iA)— my (1) 
227 


Note that we have replaced g” by g*/N°. For finite N this change has no essential 
significance. The point is to choose the power a so that interesting simplifications 
occur in the limit N — oo with g? held fixed. The cubic and quartic interaction 
vertices of the gluons are proportional to N®. On the other hand, since the gluon 
propagator goes as the inverse of the quadratic terms in , it is proportional to 
1/N“%. The coupling of the gluon to the quark does not depend on N. 

To fix a, let us focus on a specific application, the calculation of o(ete~ > 
hadrons) discussed in the last chapter. Suppose we want to calculate this cross 


1G. *t Hooft, Under the Spell of the Gauge Principle, p. 378. 
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{a) (b) 


Figure VIL4.1 


section at low energies. Consider the two-gluon exchange diagrams shown in 
Figures VIL.4.la and b. The two diagrams are of order g* and we would have to 
calculate both. Note that 1b is nonplanar: Since one gluon crosses over the other, 
the diagram cannot be drawn on the plane if we insist that lines cannot go through 
each other. 

Now the double line formalism introduced in chapter IV.6 shines. In this for- 
malism the diagrams Figure VII.4. 1a and b are redrawn as in Figure VI.4.2a and b. 
The two gluon propagators common to both diagrams give a factor 1/N7*. Now 
comes the punchline. We sum over three independent color indices in 2a, thus get- 
ting a factor N?. Grab some crayons and try to color each line in 2a with a different 
color: you will need three crayons. In contrast, we sum over only one independent 
index in 2b, getting only a factor of N. In other words, 2a dominates 2b by a factor 
N*. In the large N limit we can throw 2b away. 

Clearly, the rule is to associate one factor of N with each loop. Thus, the lowest 
order diagram, shown in 2c, with N different colors circulating in it, scales as N; 
2a scales as N?/N?*. We want 2a and 2c to scale in the same way and thus we 
choose a = I. 


«(D> <> 


(a) 


(c) (d) 


Figure VIL4.2 
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By drawing more diagrams [e.g., 2d scales as N(1/N*)N*, with the three 
factors coming from the quartic coupling, the propagators, and the sum over colors, 
respectively], you can convince yourself that planar diagrams dominate in the large 
N limit, all scaling as N. For a challenge, try to prove it. Evidently, there is a 
topological flavor to ail this. 

The reduction to planar diagrams is a vast simplification but there are still an 
infinite number of diagrams. At this stage in our mastery of field theory, we still 
can’t solve large N QCD. (As I started writing this book, there were tantalizing 
clues, based on insight and techniques developed in string theory, that a solution 
of large N QCD might be within sight. As I now go through the final revision, that 
hope has faded.) 

The double-line formalism has a natural interpretation. Group theoretically, the 
matrix gauge potential Al, transforms just like g'q¢ ; (but assuredly we are not saying 
that the gluon is a quark-antiquark bound state) and the two lines may be thought of 
as describing a quark and an antiquark propagating along, with the arrows showing 
the direction in which color is flowing. 


Random matrix theory 


There is a much simpler theory, structurally similar to large N QCD, that actually 
can be solved. I am referring to random matrix theory. 

Exaggerating a bit, we can say that quantum mechanics consists of writing 
down a matrix known as the Hamiltonian and then finding its eigenvalues and 
eigenvectors. Jn the early 1950s, when confronted with the problem of studying 
the properties of complicated atomic nuclei, Eugene Wigner proposed that instead 
of solving the true Hamiltonian in some dubious approximation we might generate 
large matrices randomly and study the distribution of the eigenvalues——a sort of 
statistical quantum mechanics. Random matrix theory has since become a rich 
and flourishing subject, with an enormous and growing literature and applications 
to numerous areas of theoretical physics and even to pure mathematics (such as 
operator algebra and number theory.)” It has obvious applications to disordered 
condensed matter systems and less obvious applications to random surfaces and 
hence even to string theory. Here J will content myself with showing how ’t Hooft’s 
observation about planar diagrams works in the context of random matrix theory. 

Let us generate N by N hermitean matrices gy randomly according to the 
probability 


P(y) = oe ve) (2) 


* For a glimpse of the mathematical literature, see D. Voiculescu, ed., Free Probability 
Theory. 
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with V(@) a polynomial in g. For example, let V(g¢) = 5mo* + gy*. The nor- 
malization { Dg P(g) = 1 fixes 


Z= | dpe N™ V() (3) 


The limit N — oo is always understood. 

As in Chapter VI.7 we are interested in p(£), the density of eigenvalues of gy. 
To make sure that you understand what is actually meant, let me describe what 
we would do were we to evaluate o(E) numerically. For some large integer NV, 
we would ask the computer to generate a hermitean matrix g with the probability 
P(@) and then to solve the eigenvalue equation gv = Ev. After this procedure had 
been repeated many times, the computer could plot the distribution of eigenvalues 
in a histogram that eventually approaches a smooth curve, called the density of 
eigenvalues p(E). 

We already developed the formalism to compute p(£) in (VI.7.1): Compute 
the real analytic function G(z) = ((1/N) tr[1/z — g]) and p(E) = —(1/z) lim 
Im G(E + ie). The average (- - -) is taken with the probability P(g): a0 


(O@) == | De No) 


You see that my choice of notation, g for the matrix and V(g) = 47? + go 
as an example, is meant to be provocative. The evaluation of Z is just like the 
evaluation of a path integral, but for an action S(g) = N tr V(¢) that does not 
involve f d¢x. Random matrix theory can be thought of as a quantum field theory 
in (0 + 0)-dimensional spacetime! 

Various field theoretic methods, such as Feynman diagrams, can all be applied 
to random matrix theory. But life is sweet in (0 + 0)-dimensional spacetime: There 
is NO space, no time, no energy, and no momentum and hence no integral to do in 
evaluating Feynman diagrams. 


The Wigner semicircle law 


Let us see how this works for the simple case V(v) = 1m?g? (we can always 
absorb m into g but we won’t). Instead of G(z), it is slightly easier to calculate 


inv eK 1 \\_, 
ao=((2) }-so 


The last equality follows from invariance under unitary transformations: 


P(g) = P(U'gU) (4) 
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Figure VII.4.3 


Expand 


zens 


ie.) 
, 1 ; 
Gi) =) =") (5) 
n=0 
Do the Gaussian integral 


! —NtrAmy? i 1 | “Nom? gret t t  gigt | 


Setting k = / and summing, we find the n = 1 term in (5) is equal to (1/z7)84 (1/m?). 

Just as in any field theory we can associate a Feynman diagram with each of the 
terms in (5). For the n = | term, we have Figure VIE.4.3. The matrix character of g 
lends itself naturally to ’t Hooft’s double-jine formalism and thus we can speak of 
quark and gluon propagators with a good deal of ease. The Feynman rules are given 
in Figure VII.4.4. We recognize ¢ as the gluon field and (6) as the gluon propagator. 
Indeed, we can formulate our problem as follows: Given the bare quark propagator 
1/z, compute the true quark propagator G (z) with all interaction effects taken into 
account. 


Le 


Figure VII.4.4 
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(e) 


Figure VIL4.5 


Let us now look at the n = 2 term in (5) 1/z° < gigr of¢; >, which we represent 


in Figure VII.4.5a. With a bit of thought you can see that the index i can be 
contracted with k, J, or j, thus giving rise to Figures VII.4.5b, c, d. Summing 
over color indices, just as in QCD, we see that the planar diagrams in 5b and 5d 
dominate the diagram in 5c by a factor N*. We can take over ’t Hooft’s observation 
that planar diagrams dominate. 
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Incidentally, in this example, you see how large N is essential, allowing us to 
get rid of nonplanar diagrams. After all, if I ask you to calculate the density of 
eigenvalues for say N = 7 you would of course protest saying that the general 
formula for solving a degree-7 polynomial equation is not even known. 

The simple example in Figure VII.4.5 already indicates how all possible dia- 
grams could be constructed. In Sb the same “unit” 1s repeated, while in 5d the same 
“unit” is nested inside a more basic diagram. A more complicated example is shown 
in 5e. You can convince yourself that for N = oo all diagrams contributing to G(z) 
can be generated by either “nesting” existing diagrams inside an overarching gluon 
propagator or “repeating” an existing structure over and over again. Translate the 
preceding sentence into two equations: “Repeat” (see Figure VII.4.6a), 


G(z) = ! + lyw@l + Lyme! 4+... 
zZ 2 ZZ Zz Zz 


1 


_— 7 
z— Xz) oe 
and “nest” (see Figure VII.4.6b), 
1 
X(z) = — G() (8) 
m 


Combining these two equations we obtain a simple quadratic equation for G(z) 
that we can immediately solve to obtain 


(6) - + + + 


+ eeees 


(b) 


Figure VII.4.6 
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Se 


Figure VIL4.7 


2 
G(z) = > (: —f/Z2— 5) (9) 


(From the definition of G(z) we see that G(z) — 1/z for large z and thus we choose 
the negative root.) We immediately deduce that 


p(E) =~. Va2 — (10) 


maz 


where a? = 4/m?. This is a famous result known as Wigner’s semicircle law. 


The Dyson gas 


I hope that you are struck by the elegance of the large N planar diagram approach. 
But you might have also noticed that the gluons do not interact. It is as if we have 
solved quantum electrodynamics while we have to solve quantum chromodynam- 
ics. What if we have to deal with V(g¢) = smo? + 2"? The gq‘ term causes the 
gluons to interact with each other, generating horrible diagrams such as the one in 
Figure VII.4.7. Clearly, diagrams proliferate and as far as I know nobody has ever 
been able to calculate G(z) using the Feynman diagram approach. 

Happily, G(z) can be evaluated using another method known as the Dyson gas 


approach. The key is to write 
yg =U'AU (1) 


where A denotes the N by N diagonal matrix with diagonal elements equal to 4,, 
i=1,..., N. Change the integration variable in (3) from g to U and A: 


Z= | dU J (I1,da,) Je" La VOW) (12) 


with J the Jacobian. Since the integrand does not depend on U we can throw 
away the integral over U. It just gives the volume of the group SU(N). Does this 
remind you of chapter VII.1? Indeed, in (11) U corresponds to the unphysical 
gauge degrees of freedom—the relevant degrees of freedom are the eigenvalues 
{A;}. As an exercise you can use the Faddeev-Popov method to calculate J. 
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Instead, we will follow the more elegant tack of determining J by arguing 
from general principles. The change of integration variables in (11) is ill defined 
when any two of the A,’s are equal, at which point J must vanish. (Recall that 
the change from Cartesian coordinates to spherical coordinates is ill defined at 
the north and south poles and indeed the Jacobian in sin 6d@dq@ vanishes at 
@ = 0 and 7.) Since the A,;’s are created equal, interchange symmetry dictates 
that J = [I yjgn Am — A JF. The power f can be fixed by « dimensional analysis. 
With N? matrix elements dg obviously has dimension aN? while (T1,d4;)J has 
dimension AN AfNA—-D/2; thus B = 2. 

Having determined J, let us rewrite (12) as 


—N Vay) 
Z= [tar )0Tp On ~ AnPe » 


- 4/2 
= f mare Nye VOWD+ > omen log((Am—An) (13) 


Dyson pointed out that in this form Z = { (T1,da,;)e7N#@1-----4w) is just the par- 
tition function of a classical 1-dimensional gas (recall Chapter V.2). Think of 2,, 
a real number, as the position of the ith molecule. The energy of a configuration 


E(y,...,Ay) = » VA) — i. SY) log(am — An)? (14) 
msén 


consists of two terms with obvious physical interpretations. The gas is confined 
in a potential well V(x) and the molecules repel? each other with the two-body 
potential —(1/N) log(x — y)?. Note that the two terms in E are of the same order 
in N since each sum counts for a power of NV. In the large N limit (we can think 
of N as the inverse temperature), we evaluate Z by steepest descent and minimize 
E, obtaining 


1 
Va)== > — (15) 


which in the continuum limit, as the poles in (15) merge into a cut, becomes 
VV’) =2P f dule()/(A — 2)], where p(z) is the unknown function we want 
to solve for and P denotes principal value. 

Defining as before G(z) = f du[p(u)/(A — )] we see that our equation for 
p(w) can be written as Re G(A + ie) = 4V’(A). In other words, G(z) is a real 
analytic function with cuts along the real axis. We are given the real part of G(z) 
on the cut and are to solve for the imaginary part. Brézin, Itzykson, Parisi, and 
Zuber have given an elegant solution of this problem. Assume for simplicity that 


3Note that this corresponds to the repulsion between energy levels in quantum 
mechanics. 
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V (z) is an even polynomial and that there is only one cut (see Exercise VII.4.7). 
Invoke symmetry and postulate the form 


G@=4[V'@ - P@V2-a| 


with P(z) anunknown even polynomial. Remarkably, the requirement G(z) —> 1/z 
for large z completely determines P(z). Pedagogically, it is clearest to go toa 
specific example, say V(z) = 4m?7z? + gz’. Since V’(z) is a cubic polynomial in 
z, P(z) has to be a quadratic (even) polynomial in z. Taking the limit z -> co 
and requiring the coefficients of z> and of z in G(z) to vanish and the coefficient 
of 1/z to be | gives us three equations for three unknowns [namely a and the 
two unknowns in P(z)]. The density of eigenvalues is then determined to be 
p(E) = /n) P(E) a? — E?. 

I think the lesson to take away here is that Feynman diagrams, in spite of their 
historical importance in quantum electrodynamics and their usefulness in helping 
us visualize what is going on, are vastly overrated. Surely, nobody imagines 
that QCD, even large N QCD, will one day be solved by summing Feynman 
diagrams. What is needed is the analog of the Dyson gas approach for large N 
QCD. Conversely, if a reader of this book manages to calculate G(z) by summing 
planar diagrams (after all, the answer is known!), the insight he or she gains might 
conceivably be useful in seeing how to deal with planar diagrams in large N QCD. 


Field theories in the large N limit 


A number of field theories have also been solved in the large N expansion. I will 
tell you about one example, the Gross-Neveu model, partly because it has some of 
the flavor of QCD. The model is defined by 


N 2 {N - 2 
SQ) = f Px |S? Vata t+ | Yo Vata (16) 
a=l 2N a=l 


Recall from Chapter IIE.3 that this theory should be renormalizable in (1+ 1)- 
dimensional spacetime. For some finite V, say N = 3, this theory certainly appears 
no easier to solve than any other fully interacting field theory. But as we will see, 
as N —» oo we can extract a lot of interesting physics. 

Using the identity (A.14) we can rewrite the theory as 


N 
S,0) = J d°x Ly Valif — 0) ba — A] (17) 


a=1 


By introducing the scalar field o (x) we have “undone” the four-fermion interac- 
tion. (Recall that we used the same trick in Chapter III.5.) You will note that the 
physics involved is similar to that behind the introduction of the weak boson to 
generate the Fermi interaction. Using what we learned in Chapters II.5 and IV.3 
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we can immediately integrate out the fermion fields to obtain an action written 
purely in terms of the o field 


S(a)=— il Px a? —iN tr logtig — a) (18) 
297 


Note the factor of N in front of the tr log term coming from the integration over NV 
fermion fields. With the malice of forethought we, or rather Gross and Neveu, have 
introduced an explicit factor of 1/N in the coupling strength in (16), so that the 
two terms in (18) both scale as N. Thus, the path integral Z = f Doe'*©) may be 
evaluated by the steepest descent or stationary phase method in the large N limit. 
We simply extremize S(o). 

Incidentally, we can see the judiciousness of the choice a = | in large N QCD 
in the same way. Integrating out the quarks in (1) we get 


S=— / atx ty F,,)F#" + N tr log(i(g — iA) — m) 
297 


and thus the two terms both scale as NW and can balance each other. The increase in 
the number of degrees of freedom has to be offset by a weakening of the coupling. 

To study the ground state behavior of the theory, we restrict our attention to field 
configurations o (x) that do not depend on x. (In other words, we are not expecting 
translation symmetry to be spontaneously broken.) We can immediately take over 
the result you got in Exercise [V.3.3 and write the effective potential 


+ VV) — a7 + ty (1 or 3) (19) 
N 22()? An . we 


We have imposed the condition (1/N)[d?V (0) /do*]l,-,, = 1/g()” as the defini- 


tion of the mass scale dependent coupling g (sz) (compare IV.3.18). The statement 
that V(o) is independent of 2 immediately gives 


1 1 1 
> = = — log # (20) 
guy gtw’)e a 


As 2 -> 00, g(y2) — 0. Remarkably, this theory is asymptotically free, just like 
QCD. If we want to, we can work backward to find the flow equation 


_t 


= e(uyt+--- (21) 


d 
Bb ae = 
The theory in its different incarnations, (16), (17), and (18), enjoys a discrete 
Z» symmetry under which 7, > y°w, and o — —o. As in Chapter IV.3, this 
symmetry is dynamically broken by quantum fluctuations. The minimum of V (a) 
occurs at O.n;, = e!~7/84)” and so according to (17) the fermions acquire a mass 


Mf = Opin = wel 7/8” (22) 
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Note that this highly nontrivial result can hardly be seen by staring at (16) and 
we have no way of proving it for finite NV. In the spirit of the large N approach, 
however, we expect that the fermion mass might be given by m,; = ywe!—7/8 (ay 4 
O(U/N 2) so that (22) would be a decent approximation even for say, N = 3. Since 
my is physically measurable, it better not depend on yz. You can check that. 

This theory also exhibits dimensional transmutation as described in the previous 
chapter. We start out with a theory with a dimensionless coupling g and end up 
with a dimensional fermion mass m ;. Indeed, any other quantity with dimension 
of mass would have to be equal to m times a pure number. 


Dynamically generated kinks 


I discuss the existence of kinks and solitons in chapter V.1. You clearly under- 
stood that the existence of such objects follows from general considerations of 
symmetry and topology, rather than from detailed dynamics. Here we have a 
(1 + 1)-dimensional theory with a discrete Z, symmetry, so we certainly expect a 
kink, namely a time independent configuration o (x) (henceforth x will denote only 
the spatial coordinate and will no longer label a generic point in spacetime) such 
that 0(—0o) = —Opjn and o(+00) = Opin. [Obviously, there is also the antikink 
with o(—0co) = Opin and o (+00) = —oni-] 

At first sight, it would seem almost impossible to determine the precise shape 
of the kink. In principle, we have to evaluate tr log[i# — o(x)] for an arbitrary 
function o (x) such that o(+co) = —o (—0o) (and as I explained in Chapter IV.3, 
this involves finding all the eigenvalues of the operator iJ — o (x), summing over 
the logarithm of the eigenvalues), and then varying this functional of o (x) to find 
the optimal shape of the kink. 

Remarkably, the shape can actually be determined thanks to a clever observa- 
tion.* In analogy with the steps leading to (IV.3.24) we note that 


tr loglig — 0 (x)] = tr log y [i — o (x) ]y? = tr log(—N[i7 + o(x)] 
and thus up to an irrelevant additive constant 


tr log(ig — o(x)) = 5 tr logli9 — o (x) li +o (x)] 


=4trlog{-a+iy'o’@)-fo@P} 3) 


4C. Callan, $. Coleman, D. Gross, and A. Zee, (unpublished). See D. J. Gross, “Appli- 
cations of the Renormalization Group to High-Energy Physics,” in: R. Balian and J. Zinn- 
Justin, eds., Methods in Field Theory, p. 247. By the way, I recommend this book to students 
of field theory. 
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Since y! has eigenvalues +i, this is equal to 
5 {tr log{—8* + o’(x) — [a (x) 7} 4 tr log{—a? — o’(x) — [o «PI 
but these two terms are equal by parity (space reflection) and hence 
tr loglig — o(x)] = tr log(—a? — 0'(x) — [o@)P} 


Referring to (18) we see that S(o) is the sum of two terms, a term quadratic in 
a(x) and a term that depends only on the combination o’(x) + [a (x)?. But we 
know that o,,;, minimizes S(o). Thus, the soliton is given by the solution of the 
ordinary differential equation 


a(x) + fo) = O.. (24) 


namely a(x) = d,y;, tanh o,,;,,*. The soliton would be observed as an object of 
size 1/Omin = 1/m p- I leave it to you to show that its mass is given by 


ms = ~ mg (25) 


Precisely as theorized in the last chapter, the ratio m ;>/m, comes out to be a pure 
number, N /7, as it must. 

By an even more clever method that I do not have space to describe, Dashen, 
Hasslacher, and Neveu were able to study time dependent configurations of o and 
determine the mass spectrum of this model. 


Exercises 


VII.4.1. Since the number of gluons only differs by one, it is generally argued that it does 
not make any difference whether we choose to study the U(NV) theory or the 
SU(N) theory. Discuss how the gluon propagator in a U/(N) theory differs from 
the gluon propagator in an SU(N) theory and decide which one is easier. 


VIL4.2. Asa challenge, solve large N QCD in (1 + 1)-dimensional spacetime. [Hint: The 
key is that in (1 + 1)-dimensional spacetime with a suitable gauge choice we can 
integrate out the gauge potential A,,.] For help, see ’t Hooft, Under the Spell of 
the Gauge Principle, p. 443. 


VII.4.3. Show that if we had chosen to calculate G(z) = ((1/N) tr(1/z — @)), we would 
have to connect the two open ends of the quark propagator. We see that Figures 
VII.4.5b and d lead to the same diagram. Complete the calculation of G(z) in this 
way. 


VII.4.4. Suppose the random matrix ¢ is real symmetric rather than hermitean. Show that 
the Feynman rules are more complicated. Calculate the density of eigenvalues. 
{Hint: The double line propagator can twist.] 
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VIL4.5. 


VIL.4.6. 


VIL4.7. 


VIL.4.8. 
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For hermitean random matrices y, calculate 


1 1 1 1 1 1 1 1 
G.(z, w) =(— tt ——_—t —-(—t — tr —— 
N z-@gN w-o@ N z~@/\N w-¢@ 
for V(g) = 4m? using Feynman diagrams. [Note that this is a much simpler 
object to study than the object we need to study in order to learn about localization 
(see Exercise VI.6.1.] Show that by taking suitable imaginary parts we can extract 


the correlation of the density of eigenvalues with itself. For help, see E. Brézin 
and A. Zee, Phys. Rev. E51: p. 5442, 1995. 


Use the Faddeev-Popov method to calculate J in the Dyson gas approach. 


For V(g) = 3m 20? + gy*, determine p(E). For m? sufficiently negative (the 
double well potential again) we expect the density of eigenvalues to split into two 
pieces. This is evident from the Dyson gas picture. Find the critical value m?. For 
m? < m? the assumption of G(z) having only one cut used in the text fails. Show 
how to calculate (£) in this regime. 


Calculate the mass of the soliton (25). 


Chapter VII.5 


Grand Unification 


Crying out for unification 


A gauge theory is specified by a group and the representations the matter fields 
belong to. Let us go back to Chapter VII.2 and make a catalogue for the SU (3) @ 
SU (2) @ U(\) theory. For example, the left handed up and down quarks are in a 
doublet (4a ) ,, With hypercharge ¥ = 3. Let us denote this by (3, 2, 3),, with 
the three numbers indicating how these fields transform under SU (3) ® SU (2) @ 
U (1). Similarly, the right handed up quark is (3, 1, 3). Writing it all down, we 
see that the quarks and leptons of each family are placed in 


3.2, 1,31 Dr 3.1, -PeR U2,-F)z, and(L1,-De =D 


This motley collection of representations practically cries out for further unifi- 
cation. Who would have constructed the universe by throwing this strange looking 
list down? 

What we would like to have is a larger gauge group G containing SU (3) @ 
SU (2) @ U (1), such that this laundry list of representations is unified into (ideally) 
one great big representation. The gauge bosons in G [but not in SU (3) ® SU(2) @ 
U (1) of course] would couple the representations in (1) to each other. 

Before we start searching for G, note that since gauge transformations 
commute with the Lorentz group, these desired gauge transformations cannot 
change left handed fields to right handed fields. So let us change all the fields 
in (1) to left handed fields. Recall from Exercise [1.1.8 that charge conjugation 
changes left handed fields to right handed fields and vice versa. Thus, instead of 
(1) we can write 


(3, 2, 2), (3", 1, -3), BL 4), CL, 2, -§), and (1, 1, D (2) 


We now omit the subscripts L and R: everybody is left handed. 


A perfect fit 


The smallest group that contains SU (3) @ SU(2) ® U(\) is SU(5). Uf you are 
shaky about group theory, study Appendix C now.) Recall that SU (5) has 5* — 1= 


391 


392 VII Grand Unification 


24 generators. Explicitly, the generators are represented by 5 by 5 hermitean 
traceless matrices acting on five objects we denote by wy“ with y= 1, 2,..., 5. 
[These five objects form the fundamental or defining representation of SU (5).] 

It is now obvious how we can fit SU(3) and SU(2) into SU(5). Of the 24 
matrices that generate SU(5), eight have the form (4 5) and three the form 
( 4 2 ), where A represents 3 by 3 hermitean traceless matrices (of which there are 
3? — 1 =8, the so-called Gell-Mann matrices) and B represents 2 by 2 hermitean 
traceless matrices (of which there are 27 — 1 = 3, namely the Pauli matrices). 
Clearly, the former generate an SU (3) and the latter an SU (2). Furthermore, the 
5 by 5 hermitean traceless matrix 


-; 0 000 
0 -} 0 0 0 
sY=| 0 0 -4 0 0 (3) 
0 0 0 5 0 
0 0 0 0 3 


generates a U (1). Without being coy about it, we have already called this matrix 
the hypercharge +Y. 

In other words, if we separate the index yp = {a, i) witha = 1, 2, 3andi =4, 5, 
then the SU'(3) acts on the index a and the SU (2) acts on the index i. Thus, the 
three objects 7° transform as a 3-dimensional representation under SU (3) and 
hence could be a 3 or a 3*. Let us choose ~® as transforming as 3; we will see 
shortly that this is the right choice with Y/2 given as in (3). The three objects wr” 
do not transform under SU (2) and hence each of them belongs to the singlet 1 
representation. Furthermore, they carry hypercharge — 4 as we can read off from 
(3). To sum up, ¥* transform as (3, 1, ~4) under SU(3) @ SU (2) @ U (1). On the 
other hand, the two objects / transform as 1 under SU (3) and 2 under SU (2), and 
carry hypercharge 4; thus they transform as (1, 2, 4). In other words, we embed 
SU (3) @ SU (2) @ UCD) into SU (5) by specifying how the defining representation 
of SU(5) decomposes into representations of SU(3) @ SU(2) @ U(1) 


5 —> (3, 1, -3) @ (1, 2, 5) (4) 
Taking the conjugate we see that 
5* > (3*, 1, 4) ® (1, 2, —4) (6) 
Inspecting (2), we see that (3*, 1, 4) and (1, 2, —}) appear on the list. We are 
on the right track! The fields in these two representations fit snugly into 5*. 
This accounts for five of the fields contained in (2); we still have the ten fields 
(3, 2, 2), (3*, 1, — 9), and (1, 1, 1) (6) 


Consider the next representation of SU(5) in order of size, namely the anti- 
symmetric tensor representation y“” . Its dimension is (5 x 4)/2 = 10. Precisely 
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the number we want, if only the quantum numbers under SU (3) @ SU(2) @ U(1) 
work out! 

Since we know that 5 — (3, 1, — 3) ® (1, 2, 4), we simply (again, see Appendix 
C!) have to work out the antisymmetric product of (3, 1, —+) (1, 2, 3) withitself, 
namely the direct sum of (where @ 4 denotes the antisymmetric product) 


(3, 1, -$) @4 3, 1, -4) = G*, 1 -) 7) 
(3, 1, -—3) @4 0, 2, 3) = G2, -4+ P=, 2, 2) (8) 

and 
(1, 2, 3) @4 (2, 9) =, 1D (9) 


[I will walk you through (7): In SU (3) 3 @4 3 = 3* (@emember ¢; ;, from Appendix 
C?), in SU (2) 1@, 1= 1, andin U(1) the hypercharges simply add —4 — 1 = —2.] 


Lo and behold, these SU (3) ®@ SU(2) @ U(1) representations form exactly the 
collection of representations in (6). In other words, 


10> 3,2,)063*,L,-)6 (LLD (10) 


The known quark and lepton fields in a given family fit perfectly into the 5* and 
10 representations of SU (5)! 

I have just described the SU (5) grand unified theory of Georgi and Glashow. In 
spite of the fact that the theory has not been directly verified by experiment, it is 
extremely difficult for me and for many other physicists not to believe that SU (5) 
is at least structurally correct, in view of the perfect group theoretic fit. 

It is often convenient to display the contents of the representation 5* and 10, 
using the names given to the various fields historically. We write 5* as a column 
vector 


dy 
t=(i)= y (11) 


and the 10 as an antisymmetric matrix 


pe? = (we wi) 


0 “ue-k u ad 
—t OF Kk u d (12) 
=| «zt -u O u d 
—u —u -u 0 €@ 
—-d -d -d -é 0 


(I suppressed the color indices.) 
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Deepening our understanding of physics 


Aside from its esthetic appeal, grand unification deepens our understanding of 
physics enormously. 

1, Ever wondered why electric charge is quantized? Why don’t we see particles 
with charge equal to ./7 times the electron’s charge? In quantum electrodynamics, 
you could perfectly well write down 


L=wli(a — id) — my + w'li(A — iva A) —m'W'+--- (13) 


In contrast, in grand unified theory A,, couples to a generator of the grand 
unifying gauge group, and you know that the generators of any simple group such 
as SU(N) (a group not given by the direct product of U (1) with other groups is said 
to be simple) are forced by the nontrivial commutation relations [T,,, T7,] = ifopcT- 
to assume quantized values. For example, the eigenvalues of 7; in SU (2), which 
depend on the representation of course, must be multiples of 4. Within SU (3) 
x SU(2) x U(1), which is not a simple group, we cannot understand charge 
quantization: The generator of U(1) is not quantized. But upon grand unification 
into SU (5) (or more generally any simple group) electric charge is quantized. 

The result here is deeply connected to Dirac’s remark (Chapter IV.4) that electric 
charge is quantized if the magnetic monopole exists. We know from Chapter V.8 
that spontaneously broken nonabelian gauge theories such as the SU(5) theory 
contain the monopole. 

2. Ever wondered why the proton charge is exactly equal and opposite to the 
electron charge? This important fact allows us to construct the universe as we know 
it. Atoms must be electrically neutral to some fantastic degree of accuracy for 
standard cosmology to work; otherwise, electrostatic forces between macroscopic 
matter would tear the universe apart. 

This remarkable fact is nicely incorporated into SU (5). It is fun to see how it 
goes. Evaluating tr @ = 0 over the 5* implies that 39; = — Q,-. I have used the 
fact that the strong interaction commutes with electromagnetism and hence quarks 
with different color have the same charge. Now let us calculate the proton charge 


Qp: 
Op =20,4+ Og=2(0g+) 4+ Gg =30g4+2=0,-+2 (14) 


If Q,- = —1, then Qp = —Q,-, as is indeed the case! 

3. Recall that in electroweak theory we defined tan 0 = g/g, with the coupling 
of the gauge bosons 82 AtT, + g,B,(¥/2). Since the normalization of At and 
B,, is fixed by their respective kinetic energy term, the relative strength of g> 
and g, is determined by the normalization of Y/2 relative to T;. Let us evaluate 
tr T? and tr(Y /2)" on the defining representation 5 : tr T? = Gy + (3)? =; and 
tr(¥/2)? = G3 + GY2 =}. 

Thus, 7; and ./3/5(Y /2) are normalized equally. So the correct grand unified 
combination is A® 7, + B,/3/5(Y/2), and therefore tan 0 = g/g. = /3/5 or 
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3 
- 2 
sin’ @ = — 15 
3 (15) 


at the grand unification scale. To compare with the experimental value of sin? @ we 
would have to study how the couplings gy and g, flow under the renormalization 
group down to low energies. We will postpone this discussion until the next chapter. 


Freedom from anomaly 


Recall from Chapter VII.2 that the key to proving renormalizability of nonabelian 
gauge theory is the ability to pass freely between the unitary gauge and the R; 
gauge. The crucial ingredient is gauge invariance and the resulting Ward-Takahashi 
identities (see Chapter II.7). 

Suddenly you start to worry. What about the chiral anomaly? The existence 
of the anomaly means that some Ward-Takahashi identities fail to hold. For our 
theories to make sense, they had better be free from anomalies. I remarked in 
Chapter IV.7 that the historical name “anomaly” makes it sound like some kind of 
sickness. Well, in a way, it is. 

We should have already checked the SU (3) ® SU(2) @ U (I) theory for anoma- 
lies, but we didn’t. I will let you do it as an exercise. Here I will show that the 
SU (5) theory is healthy. If the SU (5) theory is anomaly-free, then a fortiori so is 
the SU(3) @ SU(2) ® U()) theory. 

In Chapter IV.7 I computed the anomaly in an abelian theory but as I remarked 
there clearly all we have to do to generalize to a nonabelian theory is to insert a 
generator T,, of the gauge group at each vertex of the triangle diagram in Figure 
IV.7.1. Summing over the various fermions running around the loop, we see that 
the anomaly is proportional to A,,,(R) = tr(T,{T,, T.}), where R denotes the 
representation to which the fermions belong. We have to sum A,,,(R) over all 
the representations in the theory, remembering to associate opposite signs to left 
handed and right handed fermion fields. (It may be helpful to remind yourself of 
remark 3 and Exercise 6 in Chapter IV.7.) 

We are now ready to give the SU(5) theory a health check. First, all fermion 
fields in (2) are left handed. Second, convince yourself (simply imagine calculating 
Aape for all possible abc) that it suffices to set T,, T,, and T, all equal to 


200 0 9 
020 0 0 
T=]0 02 0 0 
000 -3 O 
000 0 -3 


a multiple of the hypercharge. Let us now evaluate tr T° on the 5* representation, 


tr T|5« = 3(—2)° + 2(43)° = 30 (16) 
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and on the 10, 


tr T?h49 = 3(+4)* + 6(-1)° + (-6)* = —30 (17) 


An apparent miracle! The anomaly cancels. 

This remarkable cancellation between sums of cubes of a strange list of numbers 
suggests strongly, to say the least, that $U/(5) is not the end of the story. Besides, 
it would be nice if the 5* and 10 could be unified into a single representation. 


VII.5.1. 


VIL5.2. 


VIL5.4. 


Exercises 
Write down the charge operator Q acting on 5, the defining representation yr“. 
Work out the charge content of the 10 = y“” and identify the various fields 


contained therein. 


Show that for any grand unified theory, as long as it is based on a simple group, 
we have at the unification scale 


(18) 


sin? 6 = 


Do? 


where the sum is taken over all fermions. 


. Check that the SU (3) ® SU(2) ® U() theory is anomaly-free. [Hint: The calcu- 


lation is more involved than in SU (5) since there are more independent generators. 
First show that you only have to evaluate tr Y{T,,, T,,} and tr Y?, with T,, and Y 
the generators of SU (2) and U (1), respectively.] 


Construct grand unified theories based on SU(6), SU(7), SU (8), ..., until you 
get tired of the game. People used to get tenure doing this. [Hint: You would have 
to invent fermions yet to be experimentally discovered.] 


Chapter VII.6 


Protons Are Not Forever 


Proton decay 


Charge conservation guarantees the stability of the electron, but what about the 
stability of the proton? Charge conservation allows p —> 2° + e+. No fundamental 
principle says that the proton lives forever, but yet the proton is known for its 
longevity: It has been around essentially since the universe began. 

The stability of the proton had to be decreed by an authority figure: Eugene 
Wigner was the first to proclaim the law of baryon number conservation. The story 
goes that when Wigner was asked how he knew that the proton lives forever he 
quipped, “I can feel it in my bones.” I take the remark to mean that just from the 
fact that we do not glow in the dark we can set a fairly good lower bound on the 
proton’s life span. 

As soon as we start grand unifying, we better start worrying. Generically, 
when we grand unify we put quarks and leptons into the same representation of 
some gauge group [see (VII.5.11 and VIL5.12)]. This miscegenation immediately 
implies that there are gauge bosons transforming quarks into leptons and vice versa. 
The bag of three quarks known as the proton could very well get turned into leptons 
upon the exchange of these gauge bosons. In other words, the proton, the rock on 
which our world is built upon, may not be forever! Thus, grand unification runs 
the risk of being immediately falsified. 

Let My denote generically the masses of those gauge bosons transforming 
quarks into leptons and vice versa. Then the amplitude for proton decay is of 
order g*/M2, with g the coupling strength of the grand unifying gauge group, 
and the proton decay rate I is given by (g?/M2)? times a phase space factor 
controlled essentially by the proton mass m p since the pion and positron masses 
are negligible compared to the proton mass. By dimensional analysis, we determine 
that ~ (g?/ M2) m3, . Since the proton is known to live for something like at least 
107! years, My had better be huge compared to the kind of energy scales we can 
reach experimentally. 

The mass My is of the same order as the mass scale Mayr at which the 
grand unified theory is spontaneously broken down to SU(3) @ SU (2) @ U()). 
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Lo 


Figure VIL6.1 


Specifically, in the SU(5) theory, a Higgs field H* transforming as the adjoint 
24, with its vacuum expectation value (Hf) equal to the diagonal matrix with 
elements (—}, —4, —4, 4, 4) times some v, can do the job, as was discussed in 
Chapter IV.6. The gauge bosons in SU(3) @ SU (2) @ U (1) remain massless while 
the other gauge bosons acquire mass My of order gv. 

To determine Mayr, we apply renormalization group flow to g3, g, and g), the 
couplings of SU (3), SU (2), and U (1), respectively. The idea is that as we move 
up in the mass or energy scale yz the two asymptotically free couplings g3(j) and 
8>(j2) decrease while g)(2) increases. Thus, at some mass scale Mgyr they will 
meet and that is where SU (3) ® SU(2) @ U (1) is unified into SU (5) (see Figure 
VIL.6.1). Because of the extremely slow logarithmic running (it should be called 
walking or even crawling but again for historical reasons we are stuck withrunning) 
of the coupling constant, we anticipate that the unification mass scale Mayr will 
come out to be much larger than any scale we were used to in particle physics prior 
to grand unification. In fact, Mgyy will turn out to have an enormous value of the 
order 10'4—!5Gev and the idea of grand unification passes its first hurdle. 


Stability of the world implies the weakness of electromagnetism 


Using the result of exercise (VI.8.1) we obtain (here ag = g3/4m and agyr = 
g’/4x denote the strong interaction and grand unification analog of the fine 
structure constant a, respectively, with F the number of families) 
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4n 1 1 1 Mout 
eS = —— + — (4F — 33) log —— (1) 
[e3(u) Asi) Agyy 67 ph 

tnd 

~ 5 = sn @@) 1 + + ar — 22) log Mout (2) 

[go(H)] a(y) Qgur on i" 
2 
3_ 4m _ _ 300s OH) _ | Ny pigg Mout (3) 


S{gi)P 5 a(e) Aur 6m 


By @(j4) we mean the value of 6 at the scale uw. At 4 = Mayr, the three couplings 
are related through SU (5). 

We evaluate these equations for some experimentally accessible value of y, 
plugging in measured values of as and aw. With three equations, we not only 
manage to determine the unification scale Mgyy and coupling agyr, but we can 
predict @. In other words, unless the ratio g; to g> is precisely right, the three lines 
in Figure VII.6.1 will not meet at one point. 

Note that the number of fermion families F contributes equally to (1), (2), 
and (3). This is as it should be since the fermions are effectively massless for the 
purpose of this calculation and do not “know” that the unifying group has been 
broken into SU(3) @ SU (2) @ U1). These equations are derived assuming that 
all fermion masses are small compared to p. 

Rearranging these equations somewhat, we find 


sin? = 1 4 22 (4) 
9ars (Ht) 

2 

si Fg A it log Mout (5) 

a(u) ao(u) On 
1 81 1 (32 Mout 
—— = — —_ + — ( — F — 22) log —S 6 
a(H) 3aGur ax (3 ) ar ©) 


We obtain in (4) a prediction for sin? @(2) independent of Mgur and of the 
number of families. 
Note that (5) gives the bound 


a > ta log Mout (7) 
a(u) ~ 67 ue 


A lower bound on the proton lifetime (and hence on Mgy7) translates into an upper 
bound on the fine structure constant. Amusingly, the stability of the world implies 
the weakness of electromagnetism. 

As I noted earlier, plugging in the measured value of as, we obtain a huge value 
for Mayr. I regard this as a triumph of grand unification: Mgyy could have come 
out to have a much lower scale, leading to an immediate contradiction with the 
observed stability of the proton, but it didn’t. Another way of looking at it is that 
if we are somehow given Moyy and egyr, grand unification fixes the couplings 
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of all three nongravitational interactions! The point is not that this simplest try at 
grand unification doesn’t quite agree with experiment: The miracle is that it works 
at all. 

It is beyond the scope of this book to discuss in detail the comparison of (4), 
(5), and (6) with experiment. To do serious phenomenology, one has to include 
threshold effects (see Exercise VII.6.1), higher order corrections, and so on. To 
make a long story short, after grand unified theory came out there was enormous 
excitement over the possibility of proton decay. Alas, the experimental lower 
bound on the proton lifetime was eventually pushed above the prediction. This 
certainly does not mean the demise of the notion of grand unification. Indeed, as 
I mentioned earlier, the perfect fit is enough to convince most particle theorists of 
the essential correctness of the idea. Over the years people have proposed adding 
various hypothetical particles to the theory to promote proton longevity. The idea is 
that these particles would affect the renormalization group flow and hence Mayr. 
The proton lifetime is actually not the most critical issue. With more accurate 
measurements of a, and of @, it was found that the three couplings do not quite 
meet at a point. Indeed, for believers in low energy supersymmetry, part of their 
faith is founded on the fact that with supersymmetric particles included, the three 
coupling constants do meet.! But skeptics of course can point to the extra freedom 
to maneuver. 


Branching ratios 


You may have realized that (1), (2), and (3) are not specific for SU (5): they hold 
as long as SU (3) ®@ SU(2) ® U()) is unified into some simple group (simple so 
that there is only one gauge coupling g). 

Let us now focus on SU (5). Recall that we decompose the $U(5) index yp, 
which can take on five values, into two types. In other words, the index jz is labeled 
by {a, i}, where a takes on three values and i takes on two values. The gauge 
bosons in SU (5) correspond to the 24 independent components of the traceless 
hermitean field A* (u, v = 1, 2, . . . , 5) transforming as the adjoint representation. 
Focusing on the group theory of SU (5), we will suppress Lorentz indices, spinor 
indices, etc. Clearly, the eight gauge bosons in SU (3) transform an index of type a 
into an index of type a, while the three gauge bosons in SU (2) transform an index 
of type i into an index of type i. Then there is the U(1) gauge boson that couples 
to the hypercharge +¥ . (Of course, you know what I mean by my somewhat loose 
language: The SU (3) gauge bosons transform fields carrying a color index into a 
field carrying a color index.) 

The fun comes with the gauge bosons Af’ and Ai , which transform the index a 
into the index 7 and vice versa. Since a takes on three values, and 7 takes on two 
values, there are 6 + 6 = 12 such gauge bosons, thus accounting for all the gauge 
bosons in SU (5). In other words, 24 > (8, 1) + (1, 3) + (1, D+ G, 2) + (3%, 2). 


' See, e.g., F. Wilczek, in: V. Fitch et al., eds., Critical Problems in Physics, p. 297. 
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Figure VII.6.2 


We will now see explicitly that the exchange of these bosons between quarks and 
leptons leads to proton decay. 

We merely have to write down the terms in the Lagrangian involving the 
coupling of the bosons A‘ and Ai to fermions and draw the appropriate Feynman 
diagrams. I will go through part of the group theoretic analysis, leaving you to 
work out the rest. Simply by contracting indices we see that the boson A” acting 
on y,, takes it to y, and acting on y”? takes it to y“°. Let us look at what A> 
does, using your result from Exercise (VII.5.1). It takes 


Ws=e > Yy=d (8) 

wer =a Pau (9) 
and 

U4 ada prtaet (10) 


Thus, the exchange of A> generates the process (Figure VII.6.2) u+d—> u+ et, 
leading to proton decay p(uud) — m°(uiz) + e+. Observe that while the decay 
p — m° + et violates both baryon number B and lepton number L, it conserves 
the combination B — L. 

In Exercise VIE.6.3 you will work out the branching ratios for various decay 
modes. Too bad experimentalists have not yet measured them. 


Fermion masses 


We might hope that with grand unification we would gain new understanding of 
quark and lepton masses. Unfortunately, the situation on fermion masses in SU (5) 
is muddled, and to this day nobody understands the origin of quark and lepton 
masses. 
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Introducing a Higgs field gy” transforming as the 5 (as indicated by the notation) 
we can write the coupling 


PCy, (11) 
and 
we CW" En vapa (12) 


(with gy, the conjugate 5*), reflecting the group theoretic fact (see Appendix C) 
that 5* © 10 contains the 5 and 10 @ 10 contains the 5*. 

Since 5 > (3, 1, —3) ® (1, 2, 5) we see that this Higgs field is just the natural 
extension of the SU(2) @ U(1) Higgs doublet (1, 2, 3). Not wanting to break 
electromagnetism, we allow only the electrically neutral fourth component of 
g to acquire a vacuum expectation value. Setting (y+) = v, we obtain (up to 
uninteresting overall constants) 


Uy + usCu™ => my =m, (13) 
and 
pC egg, => m, #0 (14) 


The larger symmetry yields a mass relation mg = m, at the unification scale; 
we again have to apply the renormalization group flow. It is worth noting that the 
mass relation mz =m, comes about because as far as the fermions are concerned, 
SU (5) has been only broken down to SU(4) by g. The trouble is that we obtain 
more or less the same relation for each of the three families, since most of the 
running occurs between the unification scale Mgyy and the top quark mass so 
that threshold effects give only a small correction. Putting in numbers one gets 
something like 

Mp ms Mg 


—~3 (15) 


m, mM, mM, 
Let us use this to predict the down sector quark masses in terms of the lepton 
masses. The formula m, ~ 3m, works rather well and provides indirect evidence 
that there can only be three families since the renormalization group flow depends 
on F. The formula m, ~ 3m,, is more or less in the ballpark, depending on what 
“experimental” value one takes for m,. The formula for m,, on the other hand, is 
downright embarrassing. People mumble something about the first family being 
so light and hence other effects, such as one-loop corrections might be important. 
At the cost of making the theory uglier, people also concoct various schemes by 
introducing more Higgs fields, such as the 45, to give mass to fermions. 
Note that in one respect SU (5) is not as economical as SU (2) ® U (1), in which 
the same Higgs field that gives mass to the gauge bosons also gives mass to the 
fermions. 
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The universe is not empty, but almost 


I mention in passing another triumph of grand unification: its ability to explain 
the origin of matter in our universe. It has long behooved physicists to understand 
two fundamental facts abut the universe: (1) the universe is not empty, and (2) the 
universe is almost empty. To physicists, (1) means that the universe is not symmet- 
ric between matter and antimatter, that is, the net baryon number Nz is nonzero; 
and (2) is quantified by the strikingly small observed value Nz /N,, ~ 10—'° of the 
ratio of the number of baryons to the number of photons. 

Suppose we start with a universe with equal quantities of matter and antimatter. 
For the universe to evolve into the observed matter dominated universe, three 
conditions must be satisfied: (1) The laws of the universe must be asymmetric 
between matter and antimatter. (2) The relevant physical processes had to be out 
of equilibrium so that there was an arrow of time. (3) Baryon number must be 
violated. 

We know for a fact that conditions (1) and (2) indeed hold in the world: There is 
C P violation in the weak interaction and the early universe expanded rapidly. As 
for (3), grand unification naturally violates baryon number. Furthermore, while 
proton decay (suppressed by a factor of 1/ Meur in amplitude) proceeds at an 
agonizingly slow rate (for those involved in the proton decay experiment!), in 
the early universe, when the X and Y bosons are produced in abundance, their 
fast decays could easily drive baryon number violation. The suppression factor 
1/M2 7 does not come in. I have no doubt that eventually the number 10~'° 
measuring “the amount of dirt in the universe” will be calculated in some grand 
unified theory. 


Hierarchy 


I promised you that the Weisskopf phenomenon would come back to haunt us. 
That the grand unification mass scale Mgur naturally comes out so large counts 
as a triumph, but it also leads to a problem known as the hierarchy problem. 
The hierarchy refers to the enormous ratio Mgyr/Mew, where Mpw denotes the 
electroweak unification scale, of order 10? Gev. I will sketch this rather murky 
subject. Look at the Higgs field g responsible for breaking electroweak theory. 
We don’t know its renormalized or physical mass precisely, but we do know 
that it is of order Mpy. Imagine calculating the bare perturbation series in some 
grand unified theory—-the precise theory does not enter into the discussion— 
starting with some bare mass jp for gy. The Weisskopf phenomenon tells us that 
quantum correction shifts Ts by a huge quadratically cutoff dependent amount 
buz~ f?A*~ f?M2..,, where we have substituted for A the only natural mass 
scale around, namely Mgyy, and where f denotes some dimensionless cou- 
pling. To have the physical mass squared yu? = 42 + 542 come out to be of order 
Mey, something like 28 orders of magnitude smaller than Meu would re- 
quire an extremely fine-tuned and highly unnatural cancellation between IT and 
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5y5. How this could happen “naturally” poses a severe challenge to theoretical 
physicists. 


Naturalness 


The hierarchy problem is closely connected with the notion of naturalness dear to 
the theoretical physics community. We naturally expect that dimensionless ratios of 
parameters in our theories should be of order unity, where the phrase “order unity” 
is interpreted liberally between friends, say anywhere from 10~? or 1073 to 10° 
or 10°. Following ’t Hooft, we can formulate a technical definition of naturalness: 
The smallness of a dimensionless parameter 7 would be considered natural only if 
a symmetry emerges in the limit 7 — 0. Thus, fermion masses could be naturally 
small, since, as you will recall from Chapter II.1, a chiral symmetry emerges when 
a fermion mass is set equal to zero. On the other hand, no particular symmetry 
emerges when we set either the bare or renormalized mass of a scalar field equal 
to zero. This represents the essence of the hierarchy problem. 


Exercises 


VIL6.1. Suppose there are F’ new families of quarks and leptons with masses of order 
M’. Adopting the crude approximation described in Exercise VI.8.2 of ignoring 
these families for 4 below M’ and of treating M’ as negligible for 4. above M’, 
run the renormalization group flow and discuss how various predictions, such as 
proton lifetime, are changed. 


VIE.6.2. Work out proton decay in detail. Derive relations between the following decay 
rates: T(p > wet), [(p > xt), P(n > wet), and P(n > 7%). 


VII.6.3. Show that SU (5) conserves the combination B — L. For a challenge, invent a 
grand unified theory that violates B — L. 


Chapter VII.7 
SO(10) Unification 


Each family into a single representation 


At the end of Chapter VII.5 we felt we had good reason to think that SU(5) 
unification is not the end of the story. Let us ask if we might be able to fit the 
5 and 10* into a single representation of a bigger group G containing SU (5). 

It turns out that there is a natural embedding of SU(5) into the orthogonal 
SO(10) that works,! but to explain that I have to teach you some group theory. 
The starting point is perhaps somewhat surprising: We go back to Chapter 1.3, 
where we learned that the Lorentz group $O (3, 1), or its Euclidean cousin $O (4), 
has spinor representations. We will now generalize the concept of spinors to d- 
dimensional Euclidean space. I will work out the details for d even and leave the 
odd dimensions as an exercise for you. You might also want to review Appendix C 
now. 


Clifford algebra and spinor representations 


Start with an assertion. For any integer n we claim that we can find 2n hermitean 
matrices y; (i = 1,2, ---, 2n) that satisfy the Clifford algebra 


{v¥j, vj} = 28; (1) 


In other words, to prove our claim we have to produce 2n hermitean matrices y; 
that anticommute with each other and square to the identity matrix. We will refer 
to the y,’s as the y matrices for SO(2n). 

For n = 1, it is a breeze: y, = t, and y, = T>. There you are. 

Now iterate. Given the 2n y matrices for SO (2n) we construct the (2n + 2) y 
matrices for SO (2n + 2) as follows 


) 


1 Y . 
yr ) _ y;” © T = J (n) ; J = 1, 2, ney on (2) 
0 —¥; 


1 Howard Georgi told me that he actually found 5O(10) before SU (5). 
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1 0 1 
Waid =ten=( :) (3) 


+1 0 -! 
Ying? =1®0= (| >| (4) 


(Throughout this book 1 denotes a unit matrix of the appropriate size.) The super- 
script in parentheses is obviously for us to keep track of which set of y matrices we 
are talking about. Verify that if the y)’s satisfy the Clifford algebra, the y?*)’s 
do as well. For example, 


1p AP = Of? @ my) de) +1@n)- Ven) 
= yy @ {r3, 71} =0 
This iterative construction yields for SO(2n) the y matrices 
Ya-1=1@1@---01O7%O4R8HR®@:-- OF (5) 
and 
Vx =101@---9@19HO9ROB@---@t; (6) 


with 1 appearing k — 1 times and 1, appearing n — k times. The y’s are evidently 
2” by 2” matrices. When and if you feel confused at any point in this discussion 
you should work things out explicitly for SO (4), SO(6), and so on. 

In analogy with the Lorentz group, we define 2n(2n — 1)/2 = n(2n — 1) her- 
mitean matrices 


oi; = sly. y;] (7) 


Note that o;; is equal toi y;y,; for i # 7 and vanishes for i = j. The commutation 
of the o’s with each other is thus easy to work out. For example, 


[o;2, 63) = —[My2. 23] = —NNVY3 + 2LVNY2 = —[yp ¥3] = 21043 


Roughly speaking, the y,’s in 0,7 and o23 knock each other out. Thus, you see 
that the 30;;’s satisfy the same commutation relations as the generators J ’s of 
SO(2n) (as given in Appendix C). The }0;;’s represent the J’/’s. 

As 2” by 2” matrices, the o’s act on an object yw with 2” components that we 
will call the spinor y. Consider the unitary transformation y — e!@/°iy with 
@;; = —@;, a set of real numbers. Then 

vlny > yet eH y, ei y = wing _ ia, w'lo;;, yl +:-: 


for w;; infinitesimal. Using the Clifford algebra we easily evaluate the commutator 
as [o;, pp Val = —2i 6x7; — 5jxy;)- MEK is not equal to either i or j then y;, clearly 


commutes with o;;, and if k is equal to either i or j, then we use ve = 1.) We 
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see that the set of objects v, = ¥'y,W, k = 1,--+-, 2n transforms as a vector in 
2n-dimensional space, with 4c,, the infinitesimal rotation angle in the 77 plane: 


Up UE 2( 040; _ @,U;) = vy _ 40,50; (8) 


(in complete analogy to yy“ transforming as a vector under the Lorentz group.) 
This gives an alternative proof that }o;; represents the generators of SO(2n). 

We define the matrix pF!VE = (—i)" 7 - - » y»,, which in the basis we are using 
has the explicit form 


pre — 7,81,@---@t (9) 


with 7, appearing n times. By analogy with the Lorentz group we define the “left 
handed” spinor ¥,; = 4(1 — yFIVE)y and the “right handed” spinor yp = 4(1+ 
yFIVE)y, such that yIVEy, = —y, and yFIVE, = Wr. Under y —> eu %ys, 
we have yr, —> efi, and yp > e!%U7iyrp since y™YE commutes with o; i 
The projection into left and right handed spinors cut the number of components into 
halves and thus we arrive at the important conclusion that the two irreducible spinor 
representations of SO(2n) have dimension 2”~!. (Convince yourself that the 
representation cannot be reduced further.) In particular, the spinor representation 
of SO(10) is 2!°/2—-! — 24 = 16—dimensional. We will see that the 5* and 10 of 
SU (5) can be fit into the 16 of SO (10). 


Embedding unitary groups into orthogonal groups 


The unitary group SU(5) can be naturally embedded into the orthogonal group 
SO(10). In fact, I will now show you that embedding SU (n) into SO(2n) is as 
easy asz=xX-+1y. 

Consider the 2n-dimensional real vectors x = (x,,°°+,%,, ¥js°°*» Yn) and 
x’ =(x},-°+. x1, y1,°°+» ¥)). By definition, SO(2n) consists of linear transfor- 
mations on these two real vectors leaving their scalar product x’x = I Cres jt 
y; y;) invariant. 

Now out of these two real vectors we can construct two n-dimensional complex 
vectors z= (x; +iy}, +++. X%,_ +iy,) and z’ = (x) +iy;,:--, x) +iy/). The group 
U(n) consists of transformations on the two n-dimensional complex vectors z and 
z’ leaving invariant their scalar product 


n 


Ne / + S\N . 
(z') z= Da; + iy) (x; t+iy,) 
J= 


n n 
_ / / . hoy 
= Dep; + yy) +i 23%; yix;) 
I= I= 
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In other words, 5§O (2n) leaves aX jt yj y,) invariant, but U (1) consists 
of the subset of those transformations in SO(2n) that leave invariant not only 
a1 5% + yiy;) but also a Oi; — yix;)- 

Now that we understand this natural embedding of U(n) into SO(2n), we see 
that the defining or vector representation of SO(2n), which we will call simply 
2n, decomposes upon restriction to U (n) into the two defining representations of 
U(n), n and n*; thus 


2n +n @n* (10) 


In other words, (X},°+-,%_5 Yi. °° *» Yq) can be written as (x; + iy], °° +X, +1Yp) 
and (x, — iy}, °+*,X, —iy,) Note that this is the analog of (VII.5.4) indicating 
that the defining representation of SU(5) decomposes into representations of 
SUG) @ SU(2) @ UC): 


5 -> (3*, 1, 4) @ (1, 2, —4). (11) 


Given the decomposition law (10), we can now figure out how other repre- 
sentations of SO(2n) decompose when restricted to the natural subgroup U (n). 
The tensor representations of SO (2n) are easy, since they are constructed out of 
the vector representation. [This is precisely what we did in going from (VII.5.4) 
to (VIL.5.7, 8, and 9).] For example, the adjoint representation of §O(2n), which 
has dimension 2” (2n — 1) /2 =n(2n — 1), transforms as an antisymmetric 2-index 
tensor 2n @, 2n and so decomposes into 


2n @, 2n-> (n@n*) @, n@n*) (12) 


according to (10). The antisymmetric product ®, on the right hand side is, of 
course, to be evaluated within U/(n). For instance, n ®, n is the n(m — 1)/2 
representation of U (n). In this way, we see that 


n(2n — 1) ~> n* — 1 (the adjoint) 
® | (the singlet) 
@®n(n — 1) /2 
® (n(n — 1)/2)" 


As acheck, the total dimension of the representations of U (7) on the right hand side 
adds up to (n? — 1) + 1+ 2n(n — 1)/2 =n(2n — 1). In particular, for SO(10) D 
SU (5), we have 45 > 24 61 10 @ 10* and of course 24+ 1+ 10+ 10= 45. 


(13) 


Decomposing the spinor 


It is more difficult to figure out how the spinor representation of SO (2n) decom- 
pose upon restriction to U (n). I give here a heuristic argument that satisfies most 
physicists, but certainly not mathematicians. I will just do SO(10) > SU (5) and 
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let you work out the general case. The question is how the 16 falls apart. Just from 
numerology and from knowing the dimensions of the smaller representations of 
SU (5) (1, 5, 10, 15) we see there are only so many possibilities, some of them 
rather unlikely, for example, the 16 falling apart into 16 1’s. 

Picture the spinor 16 of SO(10) breaking up into a bunch of representations of 
SU (5). By definition, the 45 generators of $0 (10) scramble all these representa- 
tions together. Let us ask what the various pieces of 45, namely 24 61 10 @ 10*, 
do to these representations. 

The 24 transform each of the representations of SU (5) into itself, of course, 
because they are the 24 generators of SU(5) and that is what generators do in 
life. The generator 1 can only multiply each of these representations by a real 
number. (In other words, the corresponding group element multiplies each of the 
representations by a phase factor.) 

What does the 10, which as you recali from Chapter VII.5 is represented as 
an antisymmetric tensor with two upper indices and hence also known as [2], do 
to these representations? Suppose the bunch of representations that S breaks up 
into contains the singlet (0}= 1 of SU(5). The 10 = [2] acting on [0] gives the 
{2] = 10. (Almost too obvious for words! An antisymmetric tensor of two indices 
combined with a tensor with no indices is an antisymmetric tensor of two indices.) 
What about 10 = [2] acting on [2]? The result is a tensor with four upper indices. It 
certainly contains the [4], which is equivalent to [1}* = 5*. But look, 1@ 10 @ 5* 
already adds up to 16. Thus, we have accounted for everybody. There can’t be 
more. So we conclude 


St > [0] @ [2] [4J=16 1005" (14) 


The 5* and the 10 of SU(5) fit inside the 167 of SO(10)! 

We will learn later that the two spinor representations of SO(10) are conjugate 
to each other. Indeed, you may have noticed that I snuck a superscript plus on the 
letter §. The conjugate spinor S~ breaks up into the conjugate of the representations 
in (14): 


S~ — [1] @ [3] @ [5]=5 @ 10* @ I* (15) 


The long lost antineutrino 


The fit would be perfect if we introduce one more field transforming as a 1, that is, a 
singlet under SU (5) and hence a fortiori a singlet under SU (3) @ SU(2) @ U(1). In 
other words, this field does not participate in the strong, weak, and electromagnetic 
interactions, or in plain English, it describes a lepton with no electric charge and 
is not involved in the known weak interaction. Thus, this field can be identified as 
the “long lost” antineutrino field vf . This guy does not listen to any of the known 
gauge bosons. 

Recall that we are using a convention in which all fermion fields are left handed, 
and hence we have written v’ . By a conjugate transformation, as explained earlier, 
this is equivalent to the right handed neutrino field vp. 


410 VII. Grand Unification 


Since vp is an SU(5) singlet, we can give it a Majorana mass M without 
breaking SU(5). Hence we expect M to be larger than or of the same order 
of magnitude as the mass scale at which SU(5) is broken, which as we saw 
in Chapter VIL5 is much higher than the mass scales that have been explored 
experimentally. This explains why vp has not been seen. 

On the other hand, with the presence of vp we can have a Dirac mass term 
m(VrVp + h.c.). Since this term breaks SU (2) @ U(1) just like the mass terms 
for the quarks and leptons we know, we expect m to be of the same order of 
magnitude? as the known quark and lepton masses (which for reasons unknown 
span an enormous range). 

Thus, in the space spanned by (v, v°) we have the (Majorana) mass matrix 


O m 
n= () mr) (16) 


with M >> m. Since the trace and determinant of M are M and —m?, respectively, 
M has a large eigenvalue ~ M and a small eigenvalue ~ m?/M. A tiny mass 
~ m?/M, suppressed relative to the usual quark and lepton masses by the factor 
m/M, is naturally generated for the (observed) left handed neutrino. This rather 
attractive scenario, known as the seesaw mechanism for obvious reason, was 
discovered independently by Yanagida and by Gell-Mann, Ramond, and Slansky. 

Again, the tight fit of the 5* and the 10 of SU (5) inside the 16* of SO(10) has 
convinced many physicists that it is surely right. 


A binary code for the world 


Given the direct product form of the y matrices in (5) and (6), and hence of o; fr 
we can write the states of the spinor representations as 


|e 1&2 ste En) (17) 
where each of the «’s takes on the values +1. From (9) we see that 
priv lee, +--+ &,) = (IV 18;) [ej€2 +--+ &,) (18) 


The right handed spinor S* consists of those states |€,£2 -- + €,,) with (I 1&)) = 
+1, and the left handed spinor S~ those states with (IV 1€ p> = —1. Indeed, the 


spinor representations have dimension 2” 1. 


* Explicitly, with vz now available we can add to the SU (2) @ U(1) theory of Chapter 
VII.2 the term f’Gi,W,, where ¢ = tg". In the absence of any indication to the contrary, 
we might suppose that /’ is of the same order of magntiude as the coupling f that leads to 
the electron mass. 
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Thus, in $O(10) unification the fundamental quarks and leptons are described 
by a five-bit binary code, with states such as | + +—-— +) and |—+-——-—-). 
Personally, I find this a rather pleasing picture of the world. 

Let us work out the states explicitly. This also gives me a chance to make sure 
that you understand the group theory presented in this chapter. Start with the much 
simpler case of SO(4). The spinor S* consists of | + +) and | — —) while the 
spinor S~ consists of | + —) and | — +). As discussed in Chapter II.3, SO(4) 
contains two distinct SU(2) subgroups. Removing a few factors of i from the 
discussion in Chapter II.3 we see that the third generator of SU (2), call it o3, can 
be taken to be either 07 — 034 of 0}2 +034. The two choices correspond to the two 
distinct SU (2) subgroups. We choose (arbitrarily) 03 = 4(012 — 034). From (5) and 
(6) we have o)7 = iy; y2 = 1(T, @ 13) (tT, @ 13) = — 73 @ Land 034 = —1@ 13, and 
so 03 = 4(—T; @ 1+ 1 @ 73). To figure out how the four states | + +), | — —), 
| -+ —), and | — +) transform under our chosen SU (2), let us act on them with 03. 
For example, 


o3|++) = 35-4 @14+18t)|++)= 51+ )| ++) =0 
and 
o3/—+)=3(-43 @141@r)|-+)=F04+ D|-+ =|-4) 


Aha, under SU(2) | + +) and | — —) are two singlets while | + —) and | — +) 
make up a doublet. 

Note that this is consistent with the generalization of (14) and (15), namely that 
upon the restriction of SO(2n) to U(n) the spinors decompose as 


S* > [0]0 [2]@--- (19) 
and 
S >-[lleBle--- (20) 


[ have not indicated the end of the two sequences: A moment’s reflection indicates 
that it depends on whether n is even or odd. In our example, n = 2,and thus 
2+ —» [0] @ [2] =1@ Land2~ > [1]=2. Similarly, form = 3, upon the restriction 
of SO(6) to UGB), 47 ~ [0] 6 2] =16 3* and 4 — [1106 BJ] =3 61. (Our 
choice of which triplet representation of U (3) to call 3 or 3* is made to conform 
to common usage, as we will see presently.) 

We are now ready to figure out the identity of each of the 16 states such 
as | ++—-—-+) in SO(1O) unification. First of all, (18) tells us that under 
the subgroup SO(4) @ SO(6) of SO(10) the spinor 16* decomposes as (since 
T_)¢; = +1 implies €,&) = &3&4€5) 

167 > (227,45) @(2°,4) (21) 


We identify the natural SU (2) subgroup of SO (4) as the SU (2) of the electroweak 
interaction and the natural SU (3) subgroup of SO(6) as the color SU (3) of the 
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strong interaction. Thus, according to the preceding discussion, (2+, 4*) are the 
SU (2) singlets of the standard U(1) ® SU (2) ® SU (3) model, while (2—, 47) are 
the $U (2) doublets. Here is the lineup (all fields being left handed as usual): 


SU (2) doublets: 
v=|-+-~-) 
e =|+—---——) 
u=|—+++-—-),|-++-—+), and |—+—++4+) 


d=|+~++ -), |+-+-+), and |+~-++) 


SU (2) singlets: 
vo=|/+++++) 


et=|—~+++) 
uo=|+++—-—), |++—-+-), and |++-——-+4) 
d°=|—-—+—-—-—),|--—+-—), and |————+). 


I assure you that this is a lot of fun to work out and I urge you to reconstruct 
this table without looking at it. Here are a few hints if you need help. From our 
discussion of SU (2) I know that v = | — +63€485) and e~ = | + —&3€465), but 
how do I know that 6, = e, = e, = —1? First, I know that ¢3e,¢5 = —1. I also 
know that 4~— + 3@ 1 upon restricting $O(6) to color SU (3). Well, of the four 
states |—-——), [++-—), ]|+-—+), and | —++) the “odd man out” is 
clearly | — — — ). By the same heuristic argument, among the 16 possible states 
|++++++) is the “odd man out” and so must be v°. 

There are lots of consistency checks. For example, once I identify v = | — + — 
—-—),e =|+-—-—-—), andv’= {(+++-++),I can figure out the electric 
charge Q, which, since it transforms as a singlet under color SU (3), must have 
the value Q = ae, + be, + c(€3 + &4 + €5) when acting on the state |€)62£3€485). 
The constants a, b, and c can be determined from the three equations Q(v) = 
—a+b—3c=0, O(e-) =0, and O(v") =0. Thus, OQ = — 51 + £(€3+ €4 +65). 

Living in the computer age, I find it intriguing that the fundamental constituents 
of matter are coded by five bits. You can tell your condensed matter colleagues 
that their beloved electron is composed of the binary strings + — — — — and 
——+-+-+. An intriguing possibility’ suggests itself, that quarks and leptons 
may be composed of five different species of fundamental fermionic objects. We 
construct composites, writing a + if that species is present, and a — if it is absent. 
For example, from the expression for QO given above, we see that species 1 carries 
electric charge —4, species 2 is neutral, and species 3, 4, and 5 carry charge 4. A 
more or less concrete model can even be imagined by binding these fundamental 
fermionic objects to a magnetic monopole. 


3 or further details, see F. Wilczek and A. Zee, Phys. Rev. D25: 553, 1982, Section IV. 
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A speculation on the origin of families 


One of the great unsolved puzzles in particle physics is the family problem. Why 
do quarks and leptons come in three generations {v,, e, u, d}, {v,,, 4, c, s}, and 
{v,, v, t, b}? The way we incorporate this experimental fact into our present day 
theory can only be described as pathetic: We repeat the fermionic sector of the 
Lagrangian three times without any understanding whatsoever. Three generations 
living together gives rise to a nagging family problem. 

Our binary code view of the world suggests a wildly speculative (perhaps too 
speculative to mention in a textbook?) approach to the family problem: We add 
more bits. To me, areasonable possibility is to “hyperunify” into an SO (18) theory, 
putting all fermions into a single spinorial representation S+ = 2567, which upon 
the breaking of SO(18) to SO(10) @ SO(8) decomposes as 


256+ + (167, 87) @ (16°, 8°) (22) 


We have a lot of 16*’s. Unhappily, we see that group theory [see also (21)] 
dictates that we also get a bunch of unwanted 16~’s. One suggestion is that Nature 
might repeat the trick She uses with color SU(3), whose strong force confines 
fields that are not color singlets (Chapter VII.3). Interestingly, we can exploit a 
striking feature of SO(8), which some people regard as the most beautiful of ail 
groups. In particular, the two spinorial representations 8* have the same dimension 
as the vectorial representation 8” (the equation 2”—! = 2n has the unique solution 
n = 4). There is a transformation that cyclically rotates these three representations 
8+, 8, and 8” into each other (in the jargon, the group 5O(8) admits an outer 
automorphism). Thus, there exists a subgroup SO(5) of SO(8) such that when we 
break 5O(8) into that SO(5) 8* behaves like 8” while 8~ behaves like a spinor, 
namely 


8t+>5016101 and 8 > 404 (23) 


If we call this* 5O(5) hypercolor and assumes that the strong force associated with 
it confines ail fields that are not hypercolor singlets, then only three 16*’s remain! 
Unfortunately, as the relevant physics occurs in the energy regime above grand 
unification, our knowledge of the dynamics of symmetry breaking is far too paltry 
for us to make any further statements. 


Charge conjugation 


The direct product ® notation we use here allows us to construct the conjugation 
matrix C explicitly. By definition C TokC = —a;; (so that C changes ej into 


4 The reader savvy with group theory would recognize that SO(5) is isomorphic with 
the symplectic group Sp(4) and that the Dynkin diagram of 50 (8) is the most symmetric 
of all. 
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its complex conjugate.) From (2), (3), and (4) we see that we can construct 


n+] C%@r if nodd 
c =lom en if neven (24) 
You can check that this gives C~!y ie = (—1)"y,; and hence the desired result. 
Explicitly, C is a direct product of an alternating sequence of t, and tr, and 
so we deduce an important property. Acting on [££ ---&,), C flips the sign of 
all the e’s. Thus C changes the sign of (77 _1€ j) for n odd, and does not for n 
even. For n odd, the two spinor representations S* and S~ are conjugate of each 
other, while for n even, they are conjugate of themselves, or in other words, they 
are real. This can also be seen directly from C~!yFIVEC = (—1)"y PVE. You can 
check this with all the explicit examples we have encountered: SO(2), SO(4), 
SO(6), SO), SO(10), and SO (18). See also exercise (3). 


Anomalies 


What about anomalies in §O(2n) grand unification? According to the discussion 
in Chapter VII.5 we have to evaluate AY!" = tr(yJ (7, 7’")}) over the fermion 
representation. Applying an SO (2n) transformation J‘! — O7 J'/ O we see easily 
that A‘/#/m7 is an invariant tensor. Can we constructan invariant 6-index tensor with 
the appropriate symmetry properties (e.g., A¥/k"™™ = — Asikimny in $O(2n)? We 
can’t, except in §O(6), for which we have e/J*""_ Thus, A‘/*"" vanishes except 
in SO(6), where it is proportional to e'/*"". An elegant one line proof that any 
grand unified theory based on SO(2n) for n ¥ 3 is free from anomaly! 

The cancellation of the anomaly between 5* and 10 at the end of Chapter VII.5 
doesn’t seem so miraculous any more. Miracles tend to fade away as we gain deeper 
understanding. 

Amusingly, by discussing a physics question, namely whether a gauge theory is 
renormalizable or not, we have discovered a mathematical fact. What is so special 
about §O(6)? See Exercise VII.7.5. 


Exercises 
VIL.7.1. Work out the Clifford algebra in d-dimensional space for d odd. 
ViIL.7.2. Work out the Clifford algebra in d-dimensional Minkowski space. 
VIIL.7.3. Show that the Clifford algebra for d = 4k and for d = 4k + 2 have somewhat 
different properties. (If you need help with this and the two preceding exercises, 
look up F. Wilczek and A. Zee, Phys. Rev. D25: 553, 1982.) 


VII.74. Discuss the Higgs sector of the SO(10). What do you need to give mass to the 
quarks and leptons? 


VIL7.5. 


VIL7.6. 


VIL7.7. 
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The group S$O(6) has 6(6 — 1)/2 = 15 generators. Notice that the group SU (4) 
also has 47 — 1 = 15 generators. Substantiate your suspicion that 5O(6) and 
SU (4) are isomorphic. Identify some low dimensional representations. 


Show that (unfortunately) the number of families we get in SO(18) depends on 
which subgroup of 5O(8) we take to be hypercolor. 


If you want to grow up to be a string theorist, you need to be familiar with the 
Dirac equation in various dimensions but especially in 10. As a warm up, study 
the Dirac equation in 2-dimensional spacetime. Then proceed to study the Dirac 
equation in 10-dimensional spacetime. 


PART VIII 
GRAVITY AND BEYOND 


Chapter VIII.1 


Gravity as a Field Theory and 
the Kaluza-Klein Picture 


Including gravity 


Field theory texts written a generation ago typically do not even mention gravity. 
The gravitational interaction, being so much weaker than the other three interac- 
tions, was simply not included in the education of particle physicists. The situation 
has changed with a vengeance: The main drive of theoretical high energy phys- 
ics today is the unification of gravity with the other three interactions, with string 
theory the main candidate for a unified theory. 

From a course on general relativity you would have learned about the Einstein- 
Hilbert action for gravity 


4 4. (To? 
=, | 4 x./—gR= fa x/—gM5R (1) 


where g = det g,,, denotes the determinant of the curved metric g,,, of spacetime, 
R is the scalar curvature, and G is Newton’s constant. Let me remind you that the 
Riemann curvature tensor 


R* = OTe, — a0, +070, — re (2) 


VK KO vo vo Ko 


is constructed out of the Riemann-Christoffel symbol (recall Chapter I-10): 


ry = 58 (Ov8ou + ,8pv _ IS uv) (3) 


The Ricci tensor is defined by Rux = R),, and the scalar curvature by R = 


LVK 


g"*R.,,. Varying S gives us! the Einstein field equation 


Ruy — 38 yvR = —81GT,y (4) 


pv 


The Einstein-Hilbert action is uniquely determined if we require the action to 
be coordinate invariant and to involve two powers of spacetime derivative. As you 


1 See, e.g., 5. Weinberg, Gravitation and Cosmology, p. 364. 
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can see from (2) and (3) the scalar curvature R involves two powers of derivative 
and the dimensionless field g,,, and thus has mass dimension 2. Hence G—! must 
have mass dimgnsion 2. The second form in (1) emphasizes this point and is often 
preferred in modern work on gravity. (The modified Planck mass Mp = 1/./ 167G 
differs from the usual Planck mass by a trivial factor, much like the relation between 
h and h.) 

The theory sprang from Einstein’s profound intuition regarding the curvature of 
Spacetime and is manifestly formulated in terms of geometric concepts. In many 
textbooks, Einstein’s theory is developed, and rightly so, in purely geometric terms. 

On the other hand, as I hinted back in chapter I.6, gravity can be treated on the 
same footing as the other interactions. After all, the graviton may be regarded as 
just another elementary particle like the photon. The action (1), however, does not 
look anything like the field theories we have studied thus far. I will now show you 
that in fact it does have the same kind of structure. 


Gravity as a field theory 


Let us write g,., =) +h,» where 7,,, denotes the flat Minkowski metric and 
hy the deviation from the flat metric. Expand the action in powers of h,,,,. In order 
not to drown in a sea of Lorentz indices, let us suppress them for a first go-around. 
Merely from the fact that the scalar curvature R involves two derivatives 2 in its 
definition, we see that the expansion must have the schematic form 


s= | ats : (dhdh + hahah + hahah + ---) (5) 
16zG 


after dropping total divergences. As I remarked in Chapter I.10, the field h#” (x) 
describes a graviton in flat space and is to be treated like any other field. The 
first term dhdh, which governs how the graviton propagates, is conceptually no 
different than the first term in the action for a scalar field 8g¢4¢@ or for the photon 
field @A@A. The terms cubic and higher in A determine the interaction of the 
graviton with itself. 

The Einstein-Hilbert action in the weak field expansion is structurally rem- 
iniscent of the Yang-Mills action, which may be written in schematic form as 
S = f d*x(1/g?)(@AGA + A70A + A’). As I explained in Chapter IV.5, we un- 
derstand the self interaction of the Yang-Mills bosons physically: The bosons 
themselves carry the charge to which they couple. We can understand the self 
interaction of the graviton similarly: The graviton couples to anything carrying 
energy and momentum, and it certainly carries energy and momentum itself. In 
contrast, the photon does not couple to itself. 

We say that Yang-Mills and Einstein theories are nonlinear, while Maxwell 
theory is linear. The former are hard, the latter easy. 

But while the Yang-Mills action terminates, the Einstein-Hilbert action, because 
of the presence of ./—g and of the inverse of g yy iS an infinite series in the graviton 
field h,,,. 
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eae 


(a) (b) 


Figure VIII.1.1 


The other major difference is that while Yang-Mills theory is renormalizable, 
gravity is notoriously nonrenormalizable, as we argued by dimensional analysis 
in Chapter TII.2. We are now in a position to see this explicitly. Consider the 
self energy correction to the graviton propagator shown in Figure VIII.1.la. We 
see from the second term in (5) that the three-graviton coupling involves two 
powers of momentum. Thus the Feynman integral goes as { d*k(kkkk /k*k?), 
with four powers of k in the numerator from the two vertices and four powers in the 
denominator from the propagators. Taking out two powers of momentum to extract 
the coefficient of dhdh, we see that the correction to 1/ G is quadratically divergent. 
Because of the explicit powers of momentum in the coupling, the divergence gets 
wotse and worse as we go to higher and higher order. Compare Figure VIII.1.1b to 
1a: We have three more propagators, worth ~ 1/k®, and one more loop integration 
f d+k, but two more vertices ~ k*. The degree of divergence goes up by 2. Of 
course we already knew all this by dimensional analysis. 

As mentioned in Chapter I.10 the fundamental definition 


p(y = -—2__ 58 


/-& 68 yy(x) 


tells us that coupling of the graviton to matter (in the weak field limit) can be 
included by adding the term 


= | d*x5h,yT"” (6) 


to the action, where 7“” stands for the (flat spacetime) stress-energy tensor of all 
the matter fields of the world, a matter field being any field that is not the graviton 
field. Thus, with the inclusion of matter (5) is modified schematically” to 


1 

S= | d*x[ (Qhdh + hahah + h?ahah+---)+ (AT +--)] (7) 
16xG 

In Chapter IV.5 I noted that we can bring Yang-Mills theory into the same 

convention commonly used in Maxwell theory by a trivial rescaling A > gA. 


“Tf this is to represent an expansion of S in powers of h, then strictly speaking, if 
we display the terms cubic and quartic in the Einstein-Hilbert action, we should also 
display the contribution coming from the terms of higher order in h contained in T"” (x) = 
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Similarly, we can also bring Einstein theory into the same convention by rescaling 
the graviton field h#” + ./Gh*” so that the action becomes 


S= / d*x (ahah + /Ghahah + Gh?ahdh +---+/GhT) 


We see explicitly that /167G = 1/Mp measures the strength of the graviton 
coupling to itself and to all other fields. Once again, the enormity of M p (compared 
to the scale of the strong interaction, say) indicates the feebleness of gravity. 

Here we expanded g,,,, atound a flat metric but we could just as well expand 
Suv = 8yv +Ayy, With g,,, acurved metric, that of a black hole (see Chapter V.7) 
for instance. 


Determining the weak field action 


After this index free survey we are ready to tackle the indices. We would like to 
determine the first term d/dh in (7) so that we can obtain the graviton propagator. 
Thus, we have to expand the action § = M 2 f d*x./—gg""R,,, up toand including 
order A”. From (2) and (3) we see that the Ricci tensor R uy Starts in O(h) so that 
it suffices to evaluate ./—gg"” to O(h). That’s easy: As we have already seen in 
Chapter 1.10, g = —[1 + n¥ A, + O(h*)] and g#” = nH” — h#” + O(h?) so that 
J/—egh! = nh? —hYY + Ln’ + O(h?), where we have definedh = nh). We 
now must calculate R,,,, to O(h *) | a straightforward but tedious task starting from 
(2) and (3). 

In line with the spirit of this book, which is to avoid tedious calculation when- 
ever possible, I will now show you how to get around this. We invoke symmetry 
considerations! 

Under a general coordinate transformation x# — x'# = x" — s(x) the metric 
changes to g’#” = (dx’# /ax?)(dx’” /dx")g°". Plugging in g#” = n#Y —hAhY+..., 
lowering the indices (with 7,,,, to this order), and using (0x""/0x°) = 64 — d,e", 
we find, treating 0,,¢, as of the same order as h#": 


hi = yy + 8,6) + de, (8) 


Note the structural similarity to the electromagnetic gauge transformation A), = 
A, — 9,,A. Very nice! We will explore the sense in which gravity can be regarded 
as a gauge theory in more detail later. 

We are looking for the terms in the action quadratic in # and quadratic in d@. 
Lorentz invariance tells us that there are four possible terms (To see this, first 
write down terms with the indices on the two @ matching, then the terms with the 
index on a d matching an index on an A, and so on): 


S= J d*x(ad,h*’a*h,y + ba, nia*hy +cd,h’’d"h,, + dhya"arh,,) 


with four unknown constants a, b, c, and d. Now vary S with dh,,,, = 0,6, + ,€,, 
integrating by parts freely. For example, 
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5(O,hY Yd" hy) = 218, (20"e")(O*h yy) = "400" d"h,., 


Since there are three objects linear in A, linear in €, and cubic in 0 (namely 
e’97d,h and €"4,0*d"h, » in addition to the one already shown) the condition 
dS = 0 gives three equations, just enough to fix the action up to an overall constant, 
cotresponding to Newton’s constant. The invariant combination turns out to be 


J= 40h Ohi, ~ 30, hia" h’ ~ ah a" hy, + 0 hyd* Ayy 9) 


Thus, even if we had never heard of the Einstein-Hilbert action we could still 
determine the action for gravity in the weak field limit by requiring that the action 
be invariant under the transformation (8). This is hardly surprising since coordinate 
invariance determines the Einstein-Hilbert action. Still, it is nice to construct 
gravity “from scratch.” 

Referring to (6), we can now write the weak field expansion of § as 


1 1 
Swe = } d4 J ——h,, TH” 
fe | a Gere 2H ) 


without having to expand R to O(h7). The coefficient of J is fixed by the require- 
ment that we reproduce the usual Newtonian gravity (see later). 


The graviton propagator 


As we anticipated in (5) the action S,, indeed has the same quadratic structure 
of all the field theories we have studied, and so as usual the graviton propagator is 
just the inverse of a differential operator. But just as in Maxwell and Yang-Mills 
theories the relevant differential operator in Einstein-Hilbert theory does not have 
an inverse because of the “gauge invariance” in (8). 

No problem. We have already developed the Faddeev-Popov method to deal 
with this difficulty. In fact, for my limited purposes here, to derive the graviton 
propagator in flat spacetime, I don’t even need the full-blown Faddeev-Popov 
formalism with ghosts and all.’ Indeed, recall from Chapter [11.4 that for the 
Feynman gauge (€ = 1) we simply add (@A)? to the invariant 


SF! FE, = 0" A'(a,A, — 8,A,)“ =" — Any ,d7A” — (8A)? 


thus canceling the last term. Inverting the differential operator —n vo we obtain 
the photon propagator in the Feynman gauge —i7,,,,/ k?. We play the same “trick” 
for gravity. After staring at 


J= 3d,A"YA"h,, — 30,Aa*hY — Oh aM Ay, + OALIM Ay, 


? This is because (8) does not involve the field h,,,, just as in the Maxwell case but unlike 
the Yang-Mills case. Since we do not intend to calculate loop diagrams in quantum gravity, 
we do not need the full power of the Faddeev-Popov method. 
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for a while, we see that by adding (0h, — Larpry? we can knock off the last two 
terms in J so that Sy, effectively becomes 


1f 1 1 
Suto = | d’x- ah” a*h,,, — -8 ho) —h re 10 
fg | > Ee ( A pv > a mv (10) 


In other words, the freedom in choosing A ,,, in (8) allows us to impose the so-called 
harmonic gauge condition 


a,,h% = 58,h> (11) 


(the linearized version of ,,(,/—gg"”) = 0.) 
Writing (10) in the form 


S= =.5 d4x [hY” Kyi (82) + oun')| 


we see that we have to invert the matrix 


Kuro = 3 (uate + MyotMa — Nvtro) 


regarding jzv and Aq as the two indices. Note that we have to maintain the sym- 
metry of h#*. In other words, we are dealing with matrices acting in a linear space 
spanned by symmetric two-index tensors. Thus, the identity matrix is actually 


Livac = 5 (uate + No Nr) 


You can check that Kyya9K40, = Iyv;pa 80 that K~! = K. Thus, in the harmonic 
gauge the graviton propagator in flat spacetime is given by (scaling out Newton’s 


constant) 


Lyte + Nuova ~ Nuvo 


12 
2 k2 +i¢ (12) 


Diy ra (k) = 


Newton from Einstein 


Varying (10) with respect to h ,,,, we obtain the Euler-Lagrange equation of motion* 
ang (—287h,,, + ,,,97h) — T,,y =0. Taking the trace, we find 87h = 161GT 
(with T = 7n,,,T“") and so we obtain? 


Ph, = —162G(T,,y — $n T) (13) 


4 Note that the flat spacetime energy momentum conservation 0“ T,,y = 0 together with 
the equation of motion implies 7(0“h,,,, — 43,h) = 0. 

> Thus, the Einstein equation in vacuum R,,, = 0 reduces to 37h,,,, = 0; hence the name 
“harmonic.” 
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In the static limit, Tyg is the dominant component® of the stress-energy tensor and 
(13) reduces to Vo = 47 GTp, upon recalling from Chapter I.5 that the Newtonian 
gravitational potential @ = 4/o9. We have just derived Poisson’s equation for @. 

Incidentally, this suggests another way of avoiding the tedious task of expanding 
the Einstein-Hilbert action (and hence R) to O(h”) if you are willing to accept the 
Einstein field equation (4) as given. You need expand K,,,, only to O(h) to obtain 
(13) from (4), and from (13) you can reconstruct the action to O (h?). Indeed, from 
(2) and (3) you easily get 


Ruy = 5 (8 Ayy + 3,9,8) + 8,8,h% — 3,885) + O(h?) > — 39° hyy + OH”) 


with the further simplification in harmonic gauge. But this is not quite fair since 
considerable technolo gy! (Palatini identity and all the rest) is needed to derive (4) 
from (1). 


Einstein’s theory and the deflection of light 


Consider two particles with stress-energy tensors Ti) and Ty) respectively inter- 
acting via the exchange of a graviton. The scattering amplitude is then (up to some 
overall constant not essential for our purposes here) given by 

ho 


GT) Puro Toy = 57 


v 
(271) Ty uv — Ta Tay) 
For nonrelativistic matter T™ is much larger than the other components T°/ and 
T’! (as I have just remarked), so the scattering amplitude between two lumps of 
nonrelativistic matter (say, the earth and you) is proportional to 
G 


007-00 _ 7-00-00, _ © 00,700 
27 1a — Tay Fay) = ya 27a) 


G 
242 


As explained way back in Chapters I.4 and 15, the interaction potential is given 
by the Fourier transform of the scattering amplitude, namely 


off dda’ TOM (xy TOM f de E) — 
k2 


and thus for two well-separated objects we recover the Newtonian potential 
GMM o)/r. 

We are now able to address the issue raised at the end of Chapter I.5. Suppose a 
particle theorist, Dr. Gravity, wants to propose a theory of gravity to rival Einstein’s 
theory. Dr. G claims that gravity is due to the exchange of a spin 2 particle with a 
teeny mass m, coupled to the stress-energy tensor 7”. In Chapter I.5 we worked 


© Note that, in contrast to Tg, Aog does not dominate the other components of h,,y. 
7 See S. Weinberg, Gravitation and Cosmology, pp. 290 and 364. 
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out the propagator of a massive spin 2 particle, namely 


2 

Den (k) = a) 3 (Gyr Gyo + Gyo Gy — $ GyuvGyo)/(k? ~ mz. + 1é) 
with Gy = Muy — kyky/ m2, (after a trivial notational adjustment). Since the parti- 
cle is coupled to a conserved source k,,T“" = 0 we can replace G,,, by 7,,,. Thus, 
in the limit mg — 0 we have the propagator 


1 ha + Quota — Nuvlac 


pin2 
k 
= k* + ie 


Div, Ag (14) 
Compare this with (12). Dr. G’s propagator differs from Einstein’s: 3 versus 1. 
Remarkably, gravity is not generated by an almost massless spin 2 particle. The 
“2 discontinuity” between (12) and (14) was discovered in 1970 independently by 
Iwasaki, by van Dam and Veltman, and by Zakharov. 

In Dr. G’s theory (with his own gravitational coupling Gg), the interaction 
between two particles is given by 


Gg T(y D Duy Ag (k) Ty = 


2 
(1) Q) = 3,2 a5 ATS) Tayuv — 3 foley) 


For two lumps of nonrelativistic matter this becomes 


2 4G 
5 me 5a ATT a) — 320) T(3)) = 3 3 TT) 
Dr. G simply takes his Gg = 3G and his theory passes all experimental tests. 

But wait! There is also the famous 1919 observation of the deflection of starlight 
by the sun, and the photon is definitely not a lump of nonrelativistic matter. Indeed, 
recall from Chapter I.10 (or from your course on electromagnetism) that T = Tr 
vanishes for the photon. Thus, taking Thy and Ti ) to be the stress-energy tensor 
of the sun and of the photon respectively, Binstein would have for the , Scattering 
amplitude (G/2k?)2T* (1) ” T(2) av while Dr. G would have (G¢/2k7)2T" Ti) *T 2) yv = 
2(G/ 2k*)2Th) T,2),»- Dr. G would have predicted a deflection angle of 3GM/R 
instead of 4G. M/R (with M and R the mass and radius of the sun). On the Brazilian 
island of Sobral in 1919 Einstein triumphed over Dr. G. 

As explained in Chapter 1.5, while a massive spin 2 particle has 5 degrees of 
freedom the massless graviton has only 2. (I give an analysis of the helicity +2 
structure of one graviton exchange in Appendix 2.) The 5 degrees of freedom may 
be thought of as consisting of the helicity +2 degrees of freedom we want plus 
2 helicity +1 and a helicity 0 degrees of freedom. The coupling of the helicity 
+1 degrees of freedom vanishes because k,T"* = 0. Thus, effectively, we are left 
with an extra scalar coupling to the trace T = T“” of the stress-energy tensor; as 
we can see plainly the discrepancy indeed resides in the last term of (12) and (14). 

You should be disturbed that a measurement of the deflection of starlight can 
show that a physical quantity, the graviton mass mg, is mathematically zero rather 
than less than some extremely small value. This apparent paradox was resolved by 
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A. Vainshtein in 1972.5 He found that Dr. G’s theory contains a distance scale 


(omy 
Y m' 


in the gravitational field around a body of mass M. The helicity 0 degree of freedom 
becomes effective only on the distance scales r >> ry. Inside the Vainshtein radius 
ry, the gravitational field is the same as in Einstein’s theory and experiments cannot 
distinguish between Einstein’s and Dr. G’s theories. With the current astrophysical 
bound mg; « (1074 cm)~! and M the mass of the sun, ry comes out to be much 
larger than the size of the solar system. In other words, the apparent paradox arose 
because of an interchange of limits: We can take either the characteristic distance 
of the measurement ro, (the radius of the sun in the deflection of starlight) or the 
Vainshtein radius ry to infinity first. 

So all is well: Dr. G’s theory is consistent with current measurements provided 
that he takes mg small enough. What he is not allowed to do is use the one 
graviton exchange approximation. Instead, he should solve the massive analog of 
Hinstein’s field equation (4) around a massive body such as the sun, as Vainshtein 
did in 1972. This is equivalent to expanding to all orders in the graviton field 
h and resumming: In Feynman diagram language we have an infinite number of 
diagrams corresponding to the sun emitting 1, 2, 3, - - - , oo gravitons respectively. 
The paradox is formally resolved by noting that the higher orders are increasingly 
singular as mp — 0. 


The gravity of light 


At this point, you are ready to do perturbative quantum gravity: You have the 
graviton propagator (12), and you can read off the interaction between gravitons 
from the detailed version of (7) and the interaction between the graviton and any 
other field from the term —h,,,T"". The only trouble is that you might “drown 
in a sea of indices” if you don’t watch out, as I have already warned you. 

I know of one calculation (in fact one of my favorites in theoretical physics) in 
which we can beat the indices down easily. An interesting question: Einstein said 
that light is deflected by a massive object, but is light deflected gravitationally by 
light? Tolman, Ehrenfest, and Podolsky discovered that in the weak field limit two 
light beams moving in the same direction do not interact gravitationally, but two 
light beams moving in the opposite directions do. Surprising, eh? 

The scattering of two photons k, + ky — p, + pz via the exchange of a graviton 
is given by the Feynman diagram in Figure VIII.1.2, with the momentum transfer 
gq = Pp, — F, plus another diagram with p, and p> interchanged. The Feynman rule 
for coupling a graviton to two photons can be read off from 


8A I. Vainshtein, Phys. Lett. 39B:393, 1977; see also C. Deffayet, G. Dvali, G. 
Gabadadze, and A. I. Vainshtein, Phys. Rev. D65:044026, 2002. 
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Figure VIII.1.2 


A” Thy = he (FF, x Nyy FFP) 


but all we need is that the interaction involve two powers of spacetime derivatives 
d acting on the electromagnetic potential A,, so that the graviton-photon-photon 
vertex involves 2 powers of momenta, one from each photon. Hence the scat- 
tering amplitude (with all Lorentz indices suppressed) has the schematic form 
~ (k,P\) D (kz p2). The n’s in the graviton propagator D tie the indices on (k, p,) and 
(ky Pp) together. (We have suppressed the polarization vectors of the photons, imag- 
ining that they are to be averaged over in the amplitude squared.) Referring to (12), 
we see that the amplitude is the sum of three terms such as ~ (k, - py)(ko - p2)/q’, 
~ (ky + ko)(py + P2)/q?, and ~ (ky + pz)(ko « p))/q?. Since according to Fourier the 
long distance part of the interaction potential is given by the small g behavior of 
the scattering amplitude, we need only evaluate these terms in the limit g > 0. We 
can throw almost everything away! For example, 


ky py ky ky =0, ky pp = hy (ky thy — pp) > ky - ky 


Just imagining contracting all those indices in our heads is good enough: We obtain 
the amplitude ~ (k, - p,)(ko - p2)/q?. 

If k, and k, point in the same direction, k, - kz « k, »k,; =0. Two photons moving 
in the same direction do not interact gravitationally. 

Of course, this result is not of any practical importance since electromagnetic 
effects are far more important, but this is not an engineering text. In Appendix 1 I 
give an alternative derivation of this amusing result. 


Kaluza-Klein compactification 


You have probably read about how excited Einstein was when he heard of the 
proposal of Kaluza and of Klein to extend the dimension of spacetime to 5 and 
thus unify electromagnetism and gravity. The 5th dimension is supposed to be 
compactified into a tiny circle of radius a far smaller than what experimentalists 
can See; in other words, x° is an angular variable with x° =x°>+ 2a. You have 
surely heard that string theory, at least in its original version, is based on the Kaluza- 
Klein idea. Strings live in 10-dimensional spacetime, with 6 of the dimensions 
compactified. 

I can now show you how the Kaluza-Klein mechanism works. Start with the 
action 
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1 5 
S= 16x, fe x./—g95Rs5 (15) 
in 5-dimensional spacetime. The subscript 5 serves to indicate the 5-dimensional 
quantities. We denote the 5-dimensional metric by g,4 with the indices A and B 
running over 0, 1, 2, 3, 5. 

Assume that g4, does not depend on x°. Plug into S, integrate over x°, and 
compute the effective 4-dimensional action. Since R,; and the 4-dimensional scalar 
curvature R both involve two powers of 0 and g,4 contains g,,,, we must have 
(Exercise VIIL1.5) Rs = R+.---. Thus, (15) contains the Einstein-Hilbert action 
with Newton’s gravitational constant G ~ Gs/a. 

What else do we get? We don’t even have to work through the arithmetic. 
We can argue by symmetry. Under the 5-dimensional coordinate transformation 
x4» x/A = x4 4+ eA(x), we have [see (8)] f’4 2 = hap — 948m — Ope 4. Let us 
choose €,, = 0 and €5(x) to be independent of x5: We go around and rotate each 
of the tiny circles attached to every point in our spacetime a tiny bit. Well, we 
have h), =A, and hy; = hss, but hi; =h,,5 — 0,85. But if we give the Lorentz 
4-vector h,,5 and 4-scalar ¢5 new names, call them A,, and A, this just says 
A’, =A, — 4,4, the usual electromagnetic gauge transformation! 


Since we know that the 5-dimensional action (15) is invariant under x4 > 


x/A = x4 4 gA(x), the resulting 4-dimensional action must be invariant under 

Aum Al, = A, — 4,,A and hence must contain the Maxwell action. Note once 

again the power of symmetry considerations. No need to do tedious calculations. 
Electromagnetism comes out of gravity! 


Differential geometry of Riemannian manifolds 


{ hinted earlier at a deep connection between general coordinate transformation 
and gauge transformation. Let us flesh this out by looking at differential geom- 
etry and gravity. For this sketch we will consider locally Euclidean (rather than 
Minkowskian) spaces. 

The differential geometry of Riemannian manifolds can be elegantly summa- 
rized in the language of differential forms, Consider a Riemannian manifold (such 
as a sphere) with the metric g,,,(x). Locally, the manifold is Euclidean by defini- 
tion, which means 


By (X) = €8(%)3q4€0 (X) (16) 


where the matrix e(x) may be thought of as a similarity transformation that diag- 
onalizes g,,,, and scales it to the unit matrix. Thus, for a D-dimensional manifold 
there exist D “world vectors” er (x) obviously dependent on x and labeled by the 
index a = 1, 2,---, D. The functions er (x) are known as “vielbeins” (meaning 
“many legs” in German, vierbeins = four legs for D = 4, dreibeins = three legs 
for D = 3, and so on.) In some sense, the vielbeins can be thought as the “square 
root” of the metric. 
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Let us clarify by a simple example. The familiar two-sphere (of unit radius) has 
the line element? ds? = d6* + sin? 6dy*. From the metric (ggg = 1, 84 = sin 9) 
we can read off ey = | and e = sin 6 (all other components are zero). We are 
invited to define D 1-forms e* = e4 dx". (In our example, e! = d0, e” = sin 6dg.) 

On a curved mamfold, when we parallel transport a vector, the vector changes 
when expressed in terms of the locally Euclidean coordinate frame. (This is just 
the familiar statement that on a curved manifold such as the surface of the earth 
the notion of a vector pointing straight north is a local concept: When we move 
infinitesimally away keeping our “north vector” pointing in the same direction, it 
will end up being infinitesimally rotated away from the “north vector” defined at 
the point we have just moved to.) This infinitesimal rotation of the vielbeins is 
described by 


det = —ae? (17) 


Note that since w generates an infinitesimal rotation it is an antisymmetric matrix: 
wt? = —w*, Since de® is a 2-form, w is a 1-form, known as the connection: It 
“connects” the locally Euclidean frames at nearby points. (Since the indices a, b, 
etc. are associated with the Euclidean metric 5,, we do not have to distinguish 
between upper and lower indices. When we do write upper or lower indices a, 
b, etc. it is for typographical convenience.) In the simple example of the sphere, 
de! =0 and de” =cos 6 d0 dg and so the connection has only one nonvanishing 
component w!* = ~—w?! = — cos 6 dg. 

At any point, we are free to rotate the vielbeins: If you use the vielbeins en Iam 
free to use some other vielbeins en instead, as long as mine are related to yours by 
a rotation er (x) = OF (xe? (x). [You can check that g,,,(x) = er (x54 pec (x) = 
en (x)8,,€7 (x) if OT O = 1.] The connection ’ is defined by de’? = —w'e”. 
You can readily work out that (suppressing indices) 


w= Ow'O! —(d0)o" (18) 


The local curvature of the manifold is a measure of how the connection varies 
from point to point. We would like the curvature to be invariant under the local 
rotation O (or at least to transform as a tensor so that by contracting it with 
vectors we can form a scalar). The desired object is the 2-form R® = do” + + 

wa’. Youcan check that R = OR’ O7 . (For the sphere, R!? = dw!* + ww 
sin 6 d@ dg.) Written out in components, R?? = Rv dx"dx”. T leave it to you to 
verify that Rep Pe*er is the usual Riemann curvature tensor R*, where e% is the 
inverse of the matrix ef. In particular, R’? ele? is the scalar curvature, which in 
our convention works out to be +1 for the sphere. 


* Note that this represents the square of an infinitesimal distance element and not an 
area element, and so a quantity such as d6 is literally the square of d@ and not the wedge 
product d6dé (of Chapter IV.4), which would have been identically zero. 
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Thus, Riemannian geometry can be elegantly summarized by the two statements 
(again suppressing indices) 


de+we=0 (19) 
and 
R=dw+a" (20) 


Look familiar? You should be struck by the similarity between (20) and the 
expression for the field strength in nonabelian gauge theories F = dA + A? . 
Note w transforms [see (18)] exactly the same way as the gauge potential A. 
But one nagging difference, namely the lack of an analog of e in gauge theory, 
has long bothered some theoretical physicists (but is shrugged off by most as 
inconsequential). Also, note that Einstein theory is linear in R while Yang-Mills 
theory is quadratic in F’. 

After this potentially profound comment, I end with a rather technical remark 
about the coupling of gravity to spin 4 fields. First, we of course have to Wick 
rotate so that the vierbein e* erects a locally Minkowskian rather than a Euclidean 
coordinate frame. The indices a, b, etc. are now contracted with the Minkowskian 
metric 7,,- The slight subtlety is that the Dirac gamma matrices y“ are associated 
with the Lorentz rotation of the vierbein er. (x)= OF (xe? (x) and thus carry the 
Lorentz index a rather than the “world” index jz. Hence the flat space Dirac action 
f dtxp(iy"a, — m)y must be generalized to f d4x./—gy (iy Maye, 8” 9 — 
m)y. In contrast to the action for integer spin fields in curved spacetime (see 
Chapter I.10), the Dirac action in curved spacetime involves the vierbein explicitly. 


Appendix 1: Light on light again 


The stress-energy tensor T“” of a light beam moving in the x-direction has four nonzero 
components: the energy density T™ of course, then T* =T™ since photons carry the 
same energy and momentum, next T*° — T° by symmetry, and finally T** = T™ since the 
stress-energy tensor of the electromagnetic field is traceless (Chapter [.10). Without having 
to solve Einstein’s equations in the weak field limit (13) explicitly we know immediately 
that 4° — p% — p*9 = h** =h. The metric around the light beam is given by go) = 
1+h, go. = 8x0 = —A, and g,, = —1+ A (and of course g,, = g,, = —1 plus a bunch of 
vanishing components). Consider a photon moving parallel to the light beam. Its worldline 
is determined by (recall Chapter 1.10) 


Let’s calculate d?y/dt? and d*z/dt? with (dy/dt), (dz/dt) « (dt/dt), (dx/dt). Using 
(3) we find (with j4, v restricted to 0, x) 
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d*y dx" dx” 
de2 = 5 (Oy8 yu t+ Au 8yv — Dy8 uv) —— dt at 
dt dx dt dx 
=—40,0) | Go? + dx - 20 | =-ho,nt - Sy 
Lean) | (S) . Te ae | 7 2G ~ 


For a photon moving in the same direction as the light beam dt = dx and d?y/dt* = 
d?z/d¢? =0. We have once again derived the Tolman-Ehrenfest-Podolsky effect. Note we 
never had to solve for hh. 

Incidentally, if you are a bit unsure of dt = dx, the condition ds = 0 for a light beam 
moving in the x-direction amounts to (1 + A)dt? — 2hdtdx — (1— h)dx? =0. Upon di- 
vision by dt” we obtain —(1-+ h) + 2hv + (1—A)v? =O, with v = dx /dt. The quadratic 
equation has two roots v = F(1 + A)/(1 — A). The negative root gives v = 1, and thus for a 
photon moving in the same direction as the light beam dx /dt = 1. In contrast, the positive 
root v = —(1+ A)/(1— h) describes a photon moving in the opposite direction. 


Appendix 2: The helicity structure of gravity 


To gain a deeper understanding of the difference between Einstein’s and Dr. G’s theories 
let us look at the helicity structure of the interaction in the two cases. To warm up, consider 
the interaction between two conserved currents due to the exchange of a spin 1 particle 
of momentum k and mass m: Uy Jay = K 1) Joy - Jiy Jig: Use current conservation 
k, J =0 to eliminate J° = ki J! /w (with = k°). We obtain (kik! /o? — 3) J/) Joy Let 
k point in the 3rd direction and use k? = w* — m2 to write this as —[(m?/w*) Joy Je (2) + 
Jin Joy + diy Joyh We see that as m — 0 the longitudinal component of the current J? 
indeed decouples as explained in Chapter II.7 and we obtain — 1 Mays oO” + diy “Io. 
showing explicitly that the photon has helicity +1. (Obvious notation: J't’? = 14 i7? 
etc.) 

Onward to gravity. Consider the interaction Thy T2yuv — €TyT(2), where € = 5 for 
Einstein and } for Dr. G. For ease of writing I will now omit the subscripts (1) and (2). 
Conservation k,,T“" = 0 allows us to eliminate T° =k/T/'/w and T® = kik'T i" /w?. 
Again taking k to point in the 3rd direction we obtain the mess 


4 2 
(“) 73733 42 (“) (7373 4 72372) 4 Pipl 4 722722 4 app? 
@ a 


(21) 
-#|( ) e477? | (2) 734 pul = 
m 
which simplifies in the limit m > 0 to 
pill + [22722 + ap i212 _ e(r'} + T?y(7T" + T72) (22) 


In Einstein’s theory, € = 5 and this becomes 


(rT! _ T”?)(T!! _ T”’) + ap l2pl2 (23) 
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which lo and behold is equal to 4(71t? 2p. 2b phi l+i2, 142), showing 
that indeed the graviton carries helicity +2. 


VII.1.1. 


VII. 1.2. 


VII.1.3. 


VIII.1.4. 


VIiI.1.5. 


VIII.1.6. 


VIIL1.7. 


Exercises 


Work out T“” for a scalar field. Draw the Feynman diagram for the contribution 
of one-graviton exchange to the scattering of two scalar mesons. Calculate the 
amplitude and extract the interaction energy between two mesons sitting at rest, 
thus deriving Newton’s law of gravity. 


Work out T“” for the Yang-Mills field. 


Show thatif h ,,,, does not satisfy the harmonic gauge, we can always make a gauge 
transformation with ¢, determined by 37, = 4,,h4 — 18,h} so that it does. All 
of this should be conceptually familiar from your study of electromagnetism. 


Count the number of degrees of polarization of a graviton. [Hint: Consider a plane 
wave h(x) = Ay, (kyei** just because it is a bit easier to work in momentum 
space. A symmetric tensor has 10 components and the harmonic gauge k,,h% = 
1k,h* imposes 4 conditions. Oops, we are left with 6 degrees of freedom. What 
is going on?] (Hint: You can make further gauge transformation and still stay in 
the harmonic gauge. The graviton should have only 2 degrees of polarization.] 


The Kaluza-Klein result that we argued by symmetry considerations can of course 
be derived explicitly. Let me sketch the calculation for you. Consider the metric 


ds? = By, dx"dx® — a’[do + A, (x)dx"? 


where @ denotes an angular variable 0 < 6 < 27. With A, =0, this is just 
the metric of a curved spacetime, which has a circle of radius a attached at 
every point. The transformation 6 — @ + A(x) leaves ds invariant provided 
that we also transform A,,(x) > A,(x) — 4,,A(x). Calculate the 3-dimensional 
scalar curvature Rs and show that Rs = Ry — 1a?F uvF”. Except for the precise 
coefficient } this result follows entirely from symmetry considerations and from 
the fact that R; involves two derivatives on the 5-dimensional metric, as explained 
in the text. After some suitable rescaling this is the usual action for gravity plus 
electromagnetism. Note that the 5-dimensional metric has the explicit form 


2 2 
5 Buy ~a°A,A, —a°A 
ie=( " _a2A " ss (24) 
Generalize the Kaluza-Klein construction by replacing the circles by higher 
dimensional spheres. Show that Yang-Mills fields emerge. 


The vielbeins for a spacetime with Minkowski metric is defined by g,,,(x) = 
e4 (x)nape?(x), Where the Minkowski metric 7,, replaces the Euclidean metric 
5,p- The indices a and b are to be contracted with 7,,. For example, R®? = 
dw®? + wn. gw”. Show that everything goes through as expected. 


Chapter VIII.2 


The Cosmological Constant Problem and 
the Cosmic Coincidence Problem 


The force that knows too much 


The word paradox has been debased by loose usage in the physics literature. A 
real paradox should involve a major and clear-cut discrepancy between theoret- 
ical expectation and experimental measurement. The ultraviolet catastrophe, for 
example, is a paradox, the resolution of which around the dawn of the twentieth 
century ushered in quantum physics. I now come to the most egregious paradox 
of present day physics. 

The electromagnetic force knows about the particles carrying charge, and the 
strong force knows about the particles carrying color. And the gravitational force? 
It knows everybody! More precisely, anybody carrying energy and momentum. 

Within a particle physics frame of mind, which is the only frame of mind 
we have in exploring the fundamental structure of physics, the graviton can be 
regarded as just another particle. Indeed, given that a massless spin 2 particle 
couples to the stress-energy tensor, one can reconstruct Einstein’s theory. 

Nevertheless, there is an uncomfortable feel to this whole picture. Gravity has 
to do with the curvature of spacetime, the arena in which all fields and particles 
live. The graviton is not just another particle. 

This in essence is the root origin of the paradox of the cosmological constant. 
The graviton is not just another particle—it knows too much! 


The cosmological constant 


In the absence of gravity, the addition of a constant A to the Lagrangian £ > 
£& — A has no effect whatsoever. In classical physics the Euler-Lagrange equations 
of motion depend only on the variation of the Lagrangian. In quantum field theory 
we have to evaluate the functional integral Z = [ Dge' f xh which upon the 
inclusion of A merely acquires a multiplicative factor. As we have seen repeatedly, 
a multiplicative factor in Z does not enter into the calculation of Green’s function 
and scattering amplitudes. 

Gravity, however, knows about A. Physically, the inclusion of A corresponds to 
a shift in the Hamiltonian H — H + f d>x A. Thus, the “cosmological constant” 
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A describes a constant energy or mass per unit volume permeating the universe, 
and of course gravity knows about it. 

More technically, the term in the action — f d*x A is not invariant under a 
coordinate transformation x — x’(x). In the presence of gravity, general coor- 
dinate invariance requires that the term — f{ d*x A in the action S be modified to 
—f d+x /eA, as I explained way back in Chapter 1.10. Thus, the gravitational 
field g,,, knows about A, the infamous cosmological constant introduced by Ein- 
stein and lamented by him as his biggest mistake. This often quoted lament is itself 
a mistake. The introduction of the cosmological constant is not a mistake: It should 
be there. 


Symmetry breaking generates vacuum energy 


In our discussion on spontaneous symmetry breaking, we repeatedly ignored an 
additive term 24/4, that appears in £. 

Particle physics is built on a series of spontaneous symmetry breaking. As 
the universe cools, grand unified symmetry is spontaneously broken, followed by 
electroweak symmetry breaking, then chiral symmetry breaking, Just to mention 
a few that we have discussed. At every stage a term like :+/4A appears in the 
Lagrangian, and gravity duly takes note. 

How large do we expect the cosmological constant A to be? As we will see, for 
our purposes the roughest order of magnitude estimate suffices. Let us take A to be 
of order 1. As for 2, for the three kinds of symmetry breaking I just mentioned, 
p is of order 10!7, 10”, and 1 Gev, respectively. We thus expect the cosmological 
constant A to be roughly yz4 = 2/(~')3, where the last form of writing 4 reminds 
us that A is a mass or energy density: An energy of order 4: packed into a cube 
of size «~!. But this is outrageous even if we take the smallest value for 4: We 
know that the universe is not permeated with a mass density of the order of 1 Gev 
on every cube of size 1 (Gev)—!. 

We don’t have to put in actual numbers to see that there is a humongous 
discrepancy between theoretical expectation and observational reality. If you 
want numbers, the current observational bound on the cosmological constant is 
<(10~ev)*. With the grand unification energy scale, we are off by (17+9+3)x 
4 = 116 orders of magnitude. This is the mother of all discrepancies! 

With the Planck mass Mp, ~ 10!°Gev the natural scale of gravity, we would 
expect A ~ Mi if it is of gravitational origin. We are then off by 124 orders 
of magnitude. We are not talking about the crummy caiculation of some pitiful 
theorist not fitting some experimental curve by a factor of 2. 

We can imagine the universe starting out with a negative cosmological constant, 
fined tuned to cancel the cosmological constant generated by the various episodes 
of spontaneous symmetry breaking. Or there must be a dynamical mechanism that 
adjusts the cosmological constant to zero. 

Notice I say zero, because the cosmological constant problem is basically an 
enormous mismatch between the units natural to particle physics and natural to 
cosmology. Measured in units of Gev* the cosmological constant is so incredibly 
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tiny that particle physicists have traditionally assumed that it must be zero and 
have looked in vain for a plausible mechanism to drive it to zero. One of the 
disappointments of string theory is its inability to resolve the cosmological constant 
problem. As of the writing of this chapter around the turn of the millennium, the 
brane world scenarios (Chapter I.6) have generated a great deal of excitement by 
offering a glimmer of a hope. Roughly, the idea is that the gravitational dynamics 
of the larger space that our universe is embedded in may cancel the effect of the 
cosmological constant. 


Cosmic coincidence 


But Nature has a big surprise for us. While theorists racked their brains trying 
to come up with a convincing argument that A = 0, observational cosmologists 
steadily refined their measurements and recently changed their upper bound to an 
approximate equality 


A ~ (10-7ev) 4! (1) 


The cosmological constant paradox deepens. Theoretically, it is easier to explain 
why some quantity is mathematically 0 than why it happens to be ~ 10~!4 in the 
units natural (?) to the problem. 

To make things worse, (10~7ev)* happens to be the same order of magnitude 
as the present matter density of the universe oj. This is sometimes referred to as 
the cosmic coincidence problem. 

Now the cosmological constant A is, within our present understanding, a 
parameter in the Lagrangian. On the other hand, since most of the mass density 
of the universe resides in the rest mass of baryons, as the universe expands jy (f) 
decreases as [1/R(t)P, where R(t) denotes the scale size of the universe.! In the 
far past, 0,;, was much larger than A, and in the far future, it will be much smaller. 
It just so happens that, in this particular epoch of the universe, when you and I are 
around, py ~ A. Or to be less anthropocentric, the epoch when py, ~ A happens 
to be when galaxy formation has been largely completed. 

Very bizarre! 

In their desperation, some theorists have even been driven to invoke anthropic 
selection.” 


Ror an easy introduction to cosmology, see A. Zee, Unity of Forces in the Universe, 
vol. II, chap. 10. 


? For a recent review, see A. Vilenkin, hep-th/0106083. 


Chapter VIII.3 


Effective Field Theory Approach to 
Understanding Nature 


Low energy manifestation 


The pioneers of quantum field theory, Dirac for example, tended to regard field the- 
ory as a fundamental description of Nature, complete in itself. As [have mentioned 
several times, in the 1950s, after the success of quantum electrodynamics many 
leading particle physicists rejected quantum field theory as incapable of dealing 
with the strong and weak interactions, not to mention gravity. Then came the great 
triumph of field theory in the early 1970s. But after particle physicists retrieved 
field theory from the dust bin of theoretical physics, they realized that the field 
theories they were studying might be “merely” the low energy manifestation of a 
deeper structure, a structure first identified as a grand unified theory and later as a 
string theory. Thus was developed an outlook known as the effective field theory 
approach, pace Dirac. 

The general idea is that we can use field theory to say something about phys- 
ics at low energies or equivalently long distances even if we don’t know anything 
about the ultimate theory, be it a theory built on strings or some as yet undreamed 
of structure. An important consequence of this paradigm shift was that nonrenor- 
malizable field theories became acceptable. I will illuminate these remarks with 
specific examples. 

The emergence of this effective field theory philosophy, championed especially 
by Wilson, marks another example of cross fertilization between condensed mat- 
ter and particle physics. Toward the late 1960s, Wilson and others developed a 
powerful effective field theory approach to understanding critical phenomena, cul- 
minating in his Nobel Prize. The situation in condensed matter physics is in many 
ways the opposite of that in particle physics at least as particle physics was under- 
stood in the 1960s. Condensed matter physicists know the short distance physics, 
namely the quantum mechanics of electrons and ions. But it certainly doesn’t help 
in most cases to write down the Schrédinger equation for the electrons and ions. 
Rather, what one would like to have is an effective description of how a system 
would respond when probed at low frequency and small wave vector. A striking 
example is the effective theory of the quantum Halli fluid as described in Chap- 
ter V1.2: The relevant degree of freedom is a gauge field, certainly a far cry from 
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the underlying electron. As in the o model description (Chapter VI.4) of quan- 
tum chromodynamics, it is fair to say that without experimental guidance theorists 
would have a terribly hard time deciding what the relevant low energy long dis- 
tance degrees of freedom might be. You have seen numerous other examples in 
condensed matter physics, from the Landau-Ginzburg theory of superconductivity 
to Peierls instability. 


The threshold of ignorance 


In our discussion of renormalization, I espouse the philosophy that a quantum 
field theory provides an effective description of physics up to a certain energy 
scale A, a threshold of ignorance beyond which physics not included in the theory 
comes into play. In a nonrenormalizable theory, various physical quantities that 
we might wish to calculate will come out dependent on A, thus indicating that 
the physics at or beyond the scale A is essential for understanding the low energy 
physics we are interested in. Nonrenormalizable theories suffer from not being 
totally predictive, but nevertheless they may be useful. After all, the Fermi theory 
of the weak interaction described experiments and even foretold its own demise. 

In arenormalizable theory, various physical quantities come out independent of 
A, provided that the calculated results are expressed in terms of physical coupling 
constants and masses, rather than in terms of some not particularly meaningful 
bare coupling constants and masses. Low energy physics is not sensitive to what 
happens at high energies, and we are able to parametrize our ignorance of high 
energy physics in terms of a few physical constants. 

From the late 1960s to the 1970s, one main thrust of fundamental physics was to 
classify and study renormalizable theories. As we know, this program was “more 
than spectacularly successful.” It allowed us to pin down the theory of the strong, 
the weak, and the electromagnetic interactions. 


Renormalization group flow and dimensional analysis 


The effective field theory philosophy is intrinsically tied to renormalization group 
flow. In a given field theory, as we flow toward low energies, some couplings may 
tend to zero while others do not (and if they tend to infinity as in QCD, then we 
are unable to figure out the effective theory without experimental input). Thus, the 
first step is to calculate the renormalization group flow. A simple example is given 
in Exercise VHE3.1. 

In many cases, we Can simply use dimensional analysis. As I explained in our 
earlier discussion on renormalization theory, couplings with negative dimensions 
of mass are not important at low energies. To be specific, suppose we add a gg® 
term to a Ag’ theory. The coupling g has the dimension of inverse mass squared, 
Let us define M* = 1/g. At low energies, the effect of the gg® term is suppressed 
by (E/M)*. 


Vil.3. Effective Field Theory Approach to Understanding Nature 439 


How do we understand Schwinger’s spectacular calculation of the anomalous 
magnetic moment of the electron in the effective field theory philosophy? 

Let me first tell the traditional (i.e., pre-Wilsonian) version of the story. A 
student could have asked, “Professor Schwinger, why didn’t you include the term 
(1/M) vo" F,,, in the Lagrangian?” 

The answer is that we better not. Otherwise, we would lose our prediction for 
the anomalous magnetic moment; it would depend on M. Recall that [yr] = 3 
and [A,,]= 1, and hence yo" F,,, has mass dimension } +} +1+1=5> 4. 
The requirement of renormalizability, that the Lagrangian be restricted to contain 
operators of dimension 4 or less, provides the rationale for excluding this term. 

Actually, the “real” punchline of my story is that Schwinger probably would not 
have answered the question. When I took Schwinger’s field theory class, it was well 
known among the students that it was forbidden to ask questions. Schwinger would 
simply ignore any raised hands. There was no opportunity to ask questions after 
class either: As he uttered his last sentence of an invariably beautifully prepared 
lecture, he would sail majestically out of the room. Dirac dealt with questions 
differently. I was too young to have witnessed it, but the story goes that when a 
student asked, “Professor Dirac, I did not understand . . . ,” Dirac replied “That is 
an assertion, not a question.” 

The modern retelling of the magnetic moment story turns it around. We now re- 
gard the Lagrangian of quantum electrodynamics as an effective Lagrangian which 
should include an infinite sequence of terms of ever higher dimensions, with coeffi- 
cients parametrizing our threshold of ignorance. Yes, the term (1/M) YorwF av is 
there, with some unknown M having the dimension of a mass. Schwinger’s result, 
that quantum fluctuations generate a term (a/27)(1/2m,) pow F wv? Should then 
be interpreted as saying that the anomalous magnetic moment of the electron is 
predicted to be [(a@/27r) (1/2m,) + 1/M]. The close agreement of (a'/27)(1/2m,) 
with the experimental value of the anomalous magnetic moment can then be turned 
around to set a lower bound on M >> (427/a)m,. 

Equivalently, Schwinger’s result predicts the anomalous magnetic moment 
of the electron if we have independent evidence that M is much larger than 
[(@/22)(1/2m,)] |. | want to emphasize that all of this makes total physical sense. 
For example, if you speculate that the electron has some finite size a, then you 
would expect M ~ 1/a. The anomalous magnetic moment calculation gives an 
upper bound for a, telling us that the electron must be pointlike down to some 
small scale. Alternatively, we could have had independent evidence, from electron 
scattering for example, that a has to be smaller than a certain length, thus giving 
us a lower bound on M. 

To underscore this point, imagine that in 1948 we followed Schwinger and 
quickly calculated the anomalous magnetic moment of the proton. We could 
literally have done it in 3 seconds, since all we have to do is replace m, by m, 
in the Lagrangian, thus obtaining (a/27)(1/2m,)~o""'WF,,,, which would of 
course disagree resoundingly with experiment. The disagreement tells us that we 
had not included all the relevant physics, namely that the proton interacts strongly 
and is not pointlike. Indeed, we now know that the anomalous magnetic moment 
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of the proton gets contributions from the anomalous magnetic moments and the 
orbital motion of the quarks inside the proton. 


Effective theory of proton decay 


It may seem that with the effective field theory approach we lose some predictive 
power. But effective field theories can also be surprisingly predictive. Let me give 
a specific example. Suppose we had never heard of grand unified theory. Ail we 
know is the SU (3) @ SU(2) @ U(1) theory. An experimentalist tells us that he is 
planning to see if the proton would decay. 

Without the foggiest notion about what would cause the proton to decay we can 
still write down a field theory to describe proton decay. The Lagrangian £ is to be 
constructed out of quark g and lepton / fields and must satisfy the symmetries that 
we know. Three quarks disappear, so we write down schematically gqq, but three 
spinors do not a Lorentz scalar make. We have to include a lepton field and write 
qqqi. 

Since four fermion fields are involved, the terms ggq/ have mass dimension 6 
and so in £ they have to appear as (1/M*)qqql with some mass M, corresponding 
to the mass scale of the physics responsible for proton decay. The experimental 
lower bound on the lifetime of the proton sets a lower bound on M. 

Imagine doing an effective field theory analysis of proton decay before the idea 
of quarks was conceived. We would construct an effective Lagrangian out of the 
available fields, namely the proton field p, the electron field e, and the pion field z, 
and thus write down something like gx ~ ep with some constant g. Note that 7 ep 
has mass dimension 4 and hence g is dimensionless. Since 7~ép violates isospin 
invariance, we would expect g to be of the same order as some measure of isospin 
breaking, roughly the electromagnetic coupling strength a, but this would give an 
unacceptably short lifetime to the proton. We would be forced to set g equal to a 
ridiculously small number, which would seem highly unnatural. Thus, at least in 
hindsight, we can say that the extremely long lifetime of the proton almost points 
to the existence of quarks. (Can the cosmological constant puzzle be solved in the 
same way?) 

Another way of saying this is that SU(3) ® SU(2) ® U(1) plus renormaliz- 
ability predicts one of the most striking facts of the universe, the stability of the 
proton. In contrast, the old pion-nucleon theory glaringly failed to explain this 
experimental fact. 

In accordance with our philosophy, & must be invariant under SU (3) @ 
SU (2) @ U(1), under which quark and lepton fields transform rather idiosyn- 
cratically, as we saw in Chapter VII.5. To construct & we have to sit down and list 
all Lorentz invariant SU (3) ® SU(2) @ U(1) terms of the form gqql. 

Sitting down, we would find that, assuming only one family of quarks and 
leptons for simplicity, there are only four terms we can write down for pro- 
ton decay, which I list here for the sake of completeness: (i,C qi) (urCdp), 
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(epCup)(GC4z), UiC41)(GLC4z). and (egCug)(ugCdg). Herel; =(%), and 
qi. = (4%), denote the lepton and quark doublet of SU(2) @ U(1), the twiddle 
is defined by 7/ = 1,¢'/ with SU(2) indices i, j = 1, 2 (see Appendix B), and C 
denotes the charge conjugation matrix. Color indices on the quark fields are con- 
tracted in the only possible way. The effective Lagrangian is then given by the sum 
of these four terms, with four unknown coefficients. 

The effective field theory tells us that all possible baryon number violating 
decay processes can be determined in terms of four unknowns. We expect that 
these predictions will hold to an accuracy of order (My/M )?. (If My were zero, 
SU (3) ® SU(2) @ U(1) would be exact.) 

Of course, we can increase our predictive power by making further assumptions. 
For example, if we think that proton decay is mediated by a vector particle, as in 
a generic grand unified theory, then only the first two terms in the above list are 
allowed. In a specific grand unified theory, such as the SU (5) theory, the two 
unknown coefficients are determined in terms of the grand unified coupling and 
the mass of the X boson. 

To appreciate the predictive power of the effective field theory approach, inspect 
the list of the four possible operators. We can immediately predict that while 
proton decay violates both baryon number B and lepton number L, it conserves 
the combination B — L. I emphasize that this is not at all obvious before doing 
the analysis. Could you have told the experimentalist which of the two possible 
modes n —> e*+2~ orn — e_x* he should expect? A priori, it could well be that 
B + L is conserved. 

Note that Fermi’s theory of the weak interaction would be called an effective 
field theory these days. Of course, in contrast to proton decay, beta decay was 
actually seen, and the prediction from this sort of symmetry analysis, namely the 
existence of the neutrino, was triumphantly confirmed. 

Along the same line, we could construct “an effective field theory of neu- 
trino masses. Surely one of the most exciting experimental discoveries in particle 
physics of recent years was that neutrinos are not massless. Let us construct an 
SU (2) ® U(A) invariant effective theory. Since v; resides inside [;, without do- 
ing any detailed analysis we can see that a dimension-5 operator is required: 
schematically /, /;, contains the desired neutrino bilinear but it carries hypercharge 
Y /2 = —1; on the other hand, the Higgs doublet g carries hypercharge +}, and so 
the lowest dimensional operator we can form is of the form [/gg with dimension 
3+3+1+1=5. Thus, the effective £ must contain a term (1/M)llpy, with 
M the mass scale of the new physics responsible for the neutrino mass. Thus, by 
dimensional analysis we can estimate m,, ~ m; /M, with m; some typical charged 
lepton mass. If we take m, to be the muon mass ~ 10*Mev and m, ~ 10 'ev, we 
find M ~ (102Mev)?/10~!(10~°Mev) = 10°Gev. 

The philosophy of effective field theories valid up to a certain energy scale A 
seems so obvious by now that it is almost difficult to imagine that at one time 
many emninent physicists demanded much more of quantum field theory: that it be 
fundamental up to arbitrarily high energy scales. 
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VIIL.3.4. 


VI. Gravity and Beyond 


Exercises 


Consider 
£ = 4] @y,)? + o2)"| - Aw! + o) - avie3 (1) 


We have taken the O(2) theory from Chapter 1.9 and broken the symmetry 
explicitly. Work out the renormalization group flow in the (A — g) plane and draw 
your own conclusions. 


Assuming the nonexistence of the right handed neutrino field vz (i.e., assuming 
the minimal particle content of the standard model) write down all SU (2) @ U (1) 
invariant terms that violate lepton number L by 2 and hence construct an effective 
field theory of the neutrino mass. Of course, by constructing a specific theory one 
can be much more predictive, Out of the product /;/, we can form a Lorentz scalar 
transforming as either a singlet or triplet under SU (2). Take the singlet case and 
construct a theory. [Hint: For help, see A. Zee, Phys. Lett. 93B: p. 389, 1980.] 


Let A, B, C, D denote four spin 4 fields and label their handedness by a subscript: 
yA; =hA, with h = +1. Thus, A, is right handed, A_ left handed, and so on. 
Show that 


(A,B,)(C_,D_») = —3(Any” D_p)(C_n¥ Bn) (2) 


This is an example of a broad class of identities known as Fierz identities (some 
of which we will need in discussing supersymmetry.) Argue that if proton decay 
proceeds in lowest order from the exchange of a vector particle then only the 
terms (,Cqz)(urCdp) and (eépCup)(q,Cq_) are allowed in the Lagrangian. 


Given the conclusion of the previous exercise show that the decay rate for the 
processes p> a+ +i, pom +et,n>27°+i3, andn—oa™ +e* are 
proportional to each other, with the proportionality factors determined by a 
single unknown constant [the ratio of the coefficients of (A Cq,)(urCdp) and 
(erCur)(GiCGr)} 

For help on these last three exercises see 8. Weinberg, Phys. Rev. Lett. 43: 311, 
1979; F. Wilczek and A. Zee, ibid. p. 1571; H. A. Weldon and A. Zee, Nucl. Phys. 
B173: 269, 1980. 


Chapter VIII.4 


Supersymmetry: 
A Very Brief Introduction 


Unifying bosons and fermions 


Let me start with a few of the motivations for supersymmetry. (1) All experimen- 
tally known symmetries relate bosons to bosons and fermions to fermions. We 
would like to have a symmetry, supersymmetry, relating bosons and fermions. 
(2) It is natural for fermions to be massless (recall Chapter VII.6), but not for 
bosons. Perhaps by pairing the Higgs field with a fermion field we can resolve the 
hierarchy problem mentioned in Chapter VII.6. (3) Recalling from Chapter I.5 
that fermions contribute negatively to the vacuum energy, you might be tempted 
to speculate that the cosmological constant problem could be solved if we could 
get the fermion contribution to cancel the boson contribution. 

Disappointingly, it has been more than 30 years! since the conception of su- 
persymmetry (Golfand and Likhtman constructed the first supersymmetric field 
theory in 1971) and direct experimental evidence is still lacking. All existing su- 
persymmetric theories pair known bosons with unknown fermions and known 
fermions with unknown bosons. Supersymmetry has to be broken at some mass 
scale M beyond the regime already explored experimentally, but then (as explained 
in Chapter VHI.2) we might expect a cosmological constant of order M 4 

Be that as it may, supersymmetric field theories have many nice properties 
(hardly surprising since the relevant symmetry is much larger). Supersymmetry 
has thus attracted a multitude of devotees. I give you here as brief an introduction 
to supersymmetry as I can write. In the spirit of a first exposure, I will avoid 
mentioning any subtleties and caveats, hoping that this brief introduction will be 
helpful to students before they tackle the tomes out there. 


'For a fascinating account of the early history of supersymmetry, see G. Kane and 
M. Shifman, eds., The Supersymmetric World: The Beginning of the Theory. 
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Inventing supersymmetry 


Suppose one day you wake up wanting to invent a field theory with a symmetry 
relating bosons to fermions. The first thing you would need is the same number 
of fermionic and bosonic degrees of freedom. The simplest fermion field is the 
two-component Weyl spinor yw. You would now have one complex degree of 
freedom,” so you would have to throw in a complex scalar field gy. You could 
proceed by trial and error: Write down a Lagrangian including all terms with 
dimension up to four and then adjust the various parameters in the Lagrangian 
until the desired symmetry appears. For instance, you might adjust jz in the mass 
terms p2o'g + m(Wy + vy) until the theory becomes more symmetrical so that 
the boson and the fermion have the same mass. 

If you were to try to play the game by using a Dirac spinor V and a complex 
scalar g you would be doomed to failure from the very start since there would 
be twice as many fermionic degrees of freedom as bosonic degrees of freedom. I 
believe that the development of supersymmetry was very much retarded by the fact 
that until the early 1970s most field theorists, having grown up with Dirac spinors, 
had little knowledge of Weyl spinors. That was a hint that now is the time for you 
to get thoroughly familiar with the dotted and undotted notation of Appendix E. 
To read this chapter, you need to be fluent with that notation. 


Supersymmetric algebra 


It is perfectly feasible to construct this supersymmetric field theory, known as the 
Wess-Zumino model, by trial and error, but instead I will show you an elegant 
but more abstract approach known as the superspace and superfield formalism, 
invented by Salam and Strathdee. We will have to develop a considerable amount 
of formal machinery. Everything is very super here. 

Write the supersymmetry generator taking us from g to w, as Q, (known 
as the supercharge). The statement that Q, transforms as a Weyl spinor means 
[J#’, Oo] = —i(6"”)_? Og, where J”” denotes the generators of the Lorentz 
group. Of course, since Q, is independent of the spacetime coordinates 
[P*, Q.]=0. From Appendix E we denote the conjugate of Q, by QO, and 
[ye", OM} = 16 ")* 5 OF. 

We have to write down the anticommutation relation between the Grassman 
objects Q, and Q4 and now the work we did in Appendix E really pays off. The 
supersymmetry algebra is given by 


{Qu, Of} = 2(0"),.5P,, (1) 


? One complex degree of freedom on mass shell and two complex degrees of freedom 
off mass shell. See the superfield formalism below. 
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We argue by the ‘“‘what else can it be?” method. The right-hand side must carry the 
indices a and 6 and we know that the only object that carries these indices is 0. 
The Lorentz index jz has to be contracted and the only vector around is P,,. The 
factor of 2 fixes the normalization of Q. 

By the same kind of argument we must have {Q,, Qf} = e\(o""),,F Jv + 
e788, Commuting with P* we see that the constant c, must vanish. Recalling 
that O, = yp, we have {Qy, Q,} =c2&,,; but since the left- hand “ is 
symmetric in @ and y we have c> = 0. Thus, {Q,, Og} = 0 and {O., Q3}= = 
(see Exercise VIII.4.2.) 


A basic theorem 


An important physical fact follows immediately from (1). Contracting with (a vyPa 
we obtain 


P” = (6")"{Q,, Os} (2) 
In particular the time component tells us about the Hamiltonian 


4H = {Qq, Oat = YOu, Oi} =) (QaGi + QiO.) 3) 


We obtain the important theorem that in a supersymmetric field theory any physical 
state |S > must have nonnegative energy: 


=< sQylS >? 0 (4) 
2 ar 


Superspace 


Now that we have constructed the supersymmetric algebra let us keep in mind our 
goal of constructing supersymmetric field theories. To do that, we need to figure 
out and classify how fields transform under this supersymmetric algebra. We have 
to go through a lot of formalism, the necessity for which will become clear in due 
course. 

Imagine that you are trying to invent the superspace jomensm. Let us motivate 
it by staring at the basic relation (1) {Q,, Q ge = 2(o"),3P,. A supersymmetric 
transformation Q followed by its conjugate Q; g generates a translation P,,. Hmm, 
let’s see, P,, = i(0/0x") generates translation in x“, so perhaps Q,, being Grass- 
mannian, would _generate translation in some abstract Grassmannian coordinate 
6%? (Similarly, Q F would generate translation in 6P ) 

Salam and Strathdee invented the notion of a superspace with bosonic and 
fermionic coordinates {x", 0%, ge } with the supersymmetry algebra represented 
by translations in this space. 
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So let us try Q, and QO g being something like 0/d6% and 0/ a6F, respectively. 
But then {Q,, Q g} = 0 and we don’t get (1). We have to keep playing around 
modifying Q, and Q 4. You may already see what we need. If we add a term such 
as a"9,, to Oy, then the 0/90° in Q, acting on 604, will produce something 
like the right-hand side of (1). Similarly, we will want to add a term such as 60“0,, 
to Q,. (Once again, the dotted and undotted notation we worked hard to develop 


fixes what we must write, namely (o),,,0%@,, so that the indices match and obey 
the “southwest to northeast” rule.) Thus, we represent the supercharges as 


a . 6 
0, = 398 7 i(o") 40°, (5) 
and 
y.— <2 OB (gt 6 


You see that (1) is now satisfied. Interestingly, when we translate in the fermionic 
direction we have to translate a bit in the bosonic direction as well. 


Superfield 


A superfield B(x", 6%, 6®), as the name suggests, is just a field living in super- 
space. An infinitesimal supersymmetry transformation takes 


b> G' = (14 1&"O, + iF, OO (7) 


with € and — two Grassmannian parameters. 

It turns out that we can impose some condition on ® and restrict this rather 
broad definition a bit. After staring at (5) and (6) for a while, you may realize that 
there are two other objects, 


0 
Dy = 398 + 10 40°, 


and 
D,= é ope ti (o") 9; 


that we can define, sort of the combinations orthogonal to Q, and Q g- Clearly, 
D,, and Dz anticommute with Q, and Q;. The significance of this fact is that if 
we impose the condition D,® = 0 on the superfield ®, then its transform ®’ also 
satisfies the condition. 
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A superfield ® satisfying the condition D 4 ® = Ois known as achiral superfield. 


The condition is actually easy to implement: Observe that if we define y“ = 
(x* + 16% (o#), 0%) (note we are adding two bosonic quantities here), then 


- a . ; ; 


Thus, a superfield ®(y, 6) that depends on y and 6 only is a chiral superfield. 

Let us expand ® in powers of 6 holding y fixed. Remember that 6 contains two 
components (6!, 67). Thus, we can form an object with at most two powers of 0, 
namely 66, which you worked out in Exercise E.3. Thus, as usual, power series in 
Grassmannian variables terminate, and we have 


P(y, 0) = oy) + V20¢(y) + 00F(y) 


with g(y), &(y), and F(y) merely coefficients in the series at this stage. We can 
Taylor expand once more around x: 


(y, 0) = v(x) + V20W(x) + 06 F(x) 6 
_ oo _ ) 
+ i00"88,p(x) — 4004%000"68,,0,0(x) + V20100"68,, h(x) 


We see that a chiral superfield contains a Weyl fermion field y% , and two complex 
scalar fields gy and F. 


Finding a total divergence 


Let’s do a bit of dimensional analysis for fun and profit. Given that P,, has the 
dimension of mass, which we write as [P,,] = 1 using the same notation as in 
Chapter III.2, then (1), (5), and (6) tell us that [0] =[Q]= ; and (@] = [e]= —3. 
Given [gy] = 1, then (8) tells us [y]= 3, which we know already, and [F]= 2, 
which we didn’t know. In fact, we have never met a Lorentz scalar field with mass 
dimension 2. How can we have a kinetic energy term for F in & with dimension 
4? We can’t. The term F' F already has dimension 4, and any derivative is going to 
make the dimension even higher. Also, didn’t we say that with g and w we balance 
the same number of bosonic and fermionic degrees of freedom? 

The field F(x) definitely has something strange about him. What is he doing in 
our theory? 

Under an infinitesimal supersymmetric transformation the superfield changes 
by 6@ =i(€Q + £Q)®. Referring to (8), (5), and (6), you can work out how the 
component fields y, y, and F transform (see Exercise 5). But we can go a long 


3 This is analogous to the problem of constructing a function f(x, y) satisfying the 
1 
condition Lf = 0 with L = [x(@/ay) — y(@/x)]. We define r = (x? + y*)2 and observe 
that Lr =0. Then any f that only depends on r satisfies the desired condition. 
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way invoking symmetry and dimensional analysis. For example, 5F is linear in 
— or €, which by dimensional analysis must multiply something with dimension 
[3] since [Ff] = 2 and [€]= [E]= —}. The only thing around with dimension [3] 
is 0,4, which carries an undotted index. Note it can’t be aw since ® does not 
contain ¢. By Lorentz invariance we have to find something carrying the index 1, 
and that can only be (0) 4. The dotted index on (o“),,, can only be contracted 
with £. So everything is fixed except for an overall constant: 


BF ~ 8,0" (0 aa k* (9) 


Arguing along the same lines you can easily show that dg ~ &y andédyw ~&F + 
0,goME. 

The important point here is not the overall constant in (9) but that 6 F is a total 
divergence. 

Given any superfield ® let us denote by [®]; the coefficient of @@ in an 
expansion of ® [as in (8)]. What we have learned is that under a supersymmetric 
transformation 6([®],-) is a total divergence and thus f d*x[®] rf 1s invariant under 
supersymmetry. 

Our next observation is that if D g® = 0, then D; ©? — 0 also. In other words, 
if ® is a chiral superfield, then so is (and by extension, >, &*, and so forth.) 


Supersymmetric action 


What do we want to achieve any way? We want to construct an action invariant 
under supersymmetry. 

Finally, after all this formalism we are ready. In fact, it is almost staring us in the 
face: [ d*x[im&? +1? + .--], is invariant under supersymmetry by virtue of 
the last two paragraphs. Squaring (8) and extracting the coefficient of 9 we see by 
inspection that [©2] - = (2F@ — wv). Similarly, [67] - = 3(F oe? ~ gw). Now 
do Exercise VIII.4.6. 

Looks like we have generated a mass term for the Weyl fermion y and its 
coupling to the scalar field gy, but where are the kinetic energy terms, such as 


VF")? Uy? 


Vector superfield 


The kinetic energy terms contain ¥,, which does not appear in ©. To get the 
conjugate field wz, we obviously have to use ®*, and so we are led to consider 
©‘. More formalism here! We call a superfield V(x, 0, 6) a vector superfield if 
V = V'. For example, ©*¢ is a vector superfield. 

Imagine expanding ®'® = y'g + - - - or any vector superfield V in powers of 0 
and 6. The highest power is uniquely 0606 since by the properties of Grassmannian 
variables the only object we can form is 0'070,0,. Any object quadratic in @ and 
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quadratic in 8, such as (@a“6) (00,6), can be beaten down to 6060 by using the 
kind of identities you discovered in the exercises in Appendix E. Let [V]p denote 
the coefficient of 6666 in the expansion of V. 

Again, dimensional analysis can carry us a long way. If V has mass dimension 
n, then [V], has mass dimension n + 2 since @ and 6 each has mass dimension — 7 
Let us study how [V ]p changes under an infinitesimal supersymmetry transforma- 
tiondV =i(EO + EQ)V. We use the same kind of argument as before: 5([V] p) is 
linear in £ or £, which by dimensional analysis must multiply something with di- 
mension n + 3 since [£] = [£]= — 4. This can only be the derivative 0 of something 
with dimension n + 5 , namely the coefficients of 996 and 906 in the expansion of 
V. We conclude that 5([V]p) has to have the form 0,,(. . .), namely that 5([V]p) 
is a total divergence. This is the same type of argument that allows us to conclude 
that 5([®],-) is a total divergence. 

Thus, the action f d*x[@']p is invariant under supersymmetry. 

Staring at (8), which I repeat for your convenience, 


(y, 6) = g(x) + W260 W(x) + OOF (x) (10) 
+ i60"68,p(x) — 500"000°68,,0,9(x) + V20100"68,, (x) 


we see that f d*x[@'®], contains f d*xy*'d?y (from multiplying the first term 
in ' with the fifth term in ©), { d*xdy'dg (from multiplying the fourth term in 
* with the fourth term in ©) , f d*xya"d,,y (from multiplying the second term 
in ©' with the sixth term in ©), and finally f d+x FF (from multiplying the third 
term in ' with the third term in &). It is quite amusing how derivatives of fields 
arise in supersymmetric field theories: Note that the action { d *x[@'@] y does not 
contain derivatives explicitly. 

To summarize, given a chiral superfield } we have constructed the supersym- 
metric action 


$= f atx {tot ely + (Wr +he)) (11) 
Explicitly, with the choice W(®) = tm? + 4g? we have 


S= J d*x{ag'dg +iva"d,.w + F'F 
(12) 


— (Fo — zmyy t+ eFy — soy thc.) 
An auxiliary field 
From the very beginning the field F seemed strange. Since [ F] = 2 we anticipated 


that it cannot have a kinetic energy term with mass dimension 4 and indeed it 
doesn’t. We see that it is not a dynamical field that propagates—it is an auxiliary 
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field Gust like o in Chapter III.5 and ,, in Chapter VI.3) and can be integrated out 
in the path integral f DF*DFe'*. Indeed, collect the terms that depend on F in 
S, namely 


F'F — F(mg + gg”) — F'(mg' + gy"*) =|F — mg + gg”)? ~ [met g¢?/? 


So, integrate over F and F' and get 


S= [ atx{ag'ag + ia"a,u — my + a9? + dmyy ~ gov the) 
(13) 


Note that the scalar potential V(y', y) = |mg + gy’|? > 0 in accordance with 
(4) and vanishes at its minimum, giving a zero cosmological constant. Note that 
we are no longer free to add an arbitrary constant to V(g', y) as we could ina 
nonsupersymmetric field theory. 

As expected, supersymmetric field theories are much more restrictive than ordi- 
nary field theories, and, duh, also much more symmetric. The formalism described 
here can be extended to construct supersymmetric Yang-Mills theory. As men- 
tioned earlier, if any nontrivial 4-dimensional quantum field theory turned out to be 
exactly soluble, the supersymmetric N = 4 Yang-Mills theory (Exercise VIII.4.3) 
is probably our best bet. 

I hope that this brief introduction gave you a flavor of supersymmetry and will 
enable you to go on to specialized treatises. 


Exercises 


VII.4.1. Construct the Wess-Zumino Lagrangian by the trial and error approach. 


VUI.4.2. In general there may be N supercharges Q/, with J = 1, ..., N. Show that we 
can have {Q/, O35) = €ygZ'/, where Z'/ denotes c-numbers known as central 
charges. 


VIH.4.3. From the fact that we do not know how to write consistent quantum field theories 
with fields having spin greater than 2 show that the N in Exercise 2 above 
cannot exceed 8. Theories with N = 8 supersymmetry are said to be maximally 
supersymmetric. Show that if we do not want to include gravity, N cannot be 
greater than 4. Supersymmetric N = 4 Yang-Mills theory has many remarkable 
properties. 


VII4.4. Show that 36,/008 = egg. 


VII.4.5. Work out dy, dy, and SF precisely by computing 5 = i(E°Q,, + E. O%)o. 


VIl.4.6. 


VIil.4.7. 


VIIIL4.8. 
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For any polynomial W(®) show that [W(®)];- = F[dW(~)/dy]+ terms not 
involving F. Show that for the theory (11) the potential energy is given by 
Vig", 9) = |8Wy)/dgl*. 


Construct a field theory in which supersymmetry is spontaneously broken. [Hint: 
You need at least three chiral superfields. ] 


If we can construct supersymmetric quantum field theory, surely we can construct 
supersymmetric quantum mechanics. Indeed, consider Q, = 5[0,P + o,W(x)] 
and Q» = 3[0,P — o,W(x)], where the momentum operator P = —i(d/dx) as 
usual. Define Q = Q) +1 @Q>. Study the properties of the Hamiltonian H defined 
by {Q, Q"} = 2H. 


Chapter VIII.5 


A Glimpse of String Theory as a 
2-Dimensional Field Theory 


Geometrical action for the bosonic string 


In this closing chapter, I will try to give you a tiny glimpse into string theory. Need- 
less to say, you can get only the merest whiff of the subject here, but fortunately 
excellent texts do exist and I believe that this book has prepared you for them. My 
main purpose is to show you that perhaps surprisingly the basic formulation of 
string theory is naturally phrased in terms of a 2-dimensional field theory. 

In Chapter I.10 I described a point particle tracing out a world line given by 
X(t) in D— dimensional spacetime. Recall that the action is given geometrically 
by the length of the world line 


Xu dx 
S=—m f ae axe ee (1) 
dt dt 


and remains unchanged under reparametrization t — t'(t). Recall also that clas- 
sically, S is equivalent to 


1 1dx"dXx,, ) 
Sinp = —-— | dt | -——+t+ym 2 
imp >| Tt (- drt dt Y (2) 


Now consider a string sweeping out a world sheet given by X(t, 0) in D- 
dimensional spacetime, which we have already encountered in Chapter IV.4 in 
connection with differential forms. In analogy with (1), Nambu and Goto proposed 
an action given geometrically by the area of the world sheet 


SNG =T / dtdo ,/det(d,X"d,X_,) (3) 


where 0X" = dX" /dt, 0X" = dX" /do, and (0,X"0,X,,) denotes the ab ele- 
ment of a 2 by 2 matrix. Here, as in (1), jz ranges over D values: 0, 1,..., D—1. 
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The constant T (= 1/27ra’ with a’ the slope of the Regge trajectory in particle phe- 
nomenology) corresponds to the string tension since stretching the string to enlarge 
the world sheet costs an extra amount of action proportional to T. 

In a precise parallel with the discussion for the point particle, it is preferable to 
avoid the square root and instead use the action 


S= $T / dtday2y” (a,X"9pX,,) (4) 


with y = det y,, in the path integral to quantize the string. We will now show that 
S is equivalent classically to Sy. 

As in (2), we vary S with respect to the auxiliary variable y,,, which we then 
eliminate. For a matrix M, 5M! = —M~\($M)M~ and 4 det M = de™'8 @ — 
et log Miryy—18M = (det M)trM—18M. Thus, dy? = —y"“Sy,,y7? and by = 
yy"*6yq,. For ease of writing, define hg, = 8,X"4,X,,. The variation of the in- 
tegrand in (4) thus gives 


1 i 
bly? y hal =v gy 8¥caly Rab) — YS ¥ea¥™ hab 
Setting the coefficient of dy,, equal to 0 we obtain 


heg = 5 Vea (Yhap) (3) 


where the indices on A are raised and lowered by the metric y . Multiplying (5) by 
7° (and summing over repeated indices) we find y?°h,, =2 and thus y,. =A. 
Plugging this into (4) we find that S= T f dtdo (det h)2. Thus, S and Syg are 
indeed equivalent classically. The action (4), first discovered by Brink, Di Vecchia, 
and Howe and by Deser and Zumino, is known as the Polyakov action. 

Note that (5) determines y,,, only up to an arbitrary local rescaling known as a 
Weyl transformation: 


YaplT, a) > ery icc, a) (6) 
Thus, the action (4) must be invariant under the Wey] transformation. 

Staring at the string action (4), you will recognize that it is just the action for 
a quantum field theory of D massless scalar fields X“(t, a) in 2-dimensional 
spacetime with coordinates (t, 0), albeit with some unusual signs. The index 
uw plays the role of an internal index, and Poincaré invariance in our original 
D-dimensional spacetime now appears as an internal symmetry. Indeed, a good 
deal of string theory is devoted to the study of quantum field theories in 2- 
dimensional spacetime! It is amusing how quantum field theory manages to stay 
on the stage. 
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To this bosonic string theory we can add fermionic variables in such a way 
as to make the action supersymmetric. The result, as you surely have heard, is 
superstring theory, thought by some to be the theory of everything! 

This infinitesimal introduction to string theory is all I can give you here, but 
I hope that this book has prepared you adequately to begin studying various 
specialized texts on string theory.” 


1To understand macroscopic properties of matter based on understanding these mi- 
croscopic laws is just unrealistic. Even though the microscopic laws are, in a strict sense, 
controlling what happens at the larger scale, they are not the right way to understand that. 
And that is why this phrase, “theory of everything,” sounds sleazy.”--J. Schwarz, one of 
the founders of string theory. 

2 For a brief but authoritative introduction, see E. Witten, “Reflections on the Fate of 
Spacetime,” Physics Today, April 1996, p. 24. 
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As I confessed in the preface, I started out intending to write a concise introduction 
to quantum field theory, but the book grew and grew. The subject is simply too rich. 
As I mentioned, after a period of almost being abandoned, quantum field theory 
came roaring back. To quote my thesis advisor Sidney Coleman, the triumph of 
quantum field theory was veritably “a victory parade” that made “the spectator 
gasp with awe and laugh with joy.” 

String theory is beautiful and marvellous, but until it is verified, quantum field 
theory remains the true theory of everything. All of physics can now be said to be 
derivable from field theory. To start with, quantum field theory contains quantum 
mechanics as a (0 + 1)-dimensional field theory, and to end (perhaps) with, string 
theory may be formulated as a (1 + 1)-dimensional field theory. 

Quantum field theory can arguably be regarded as the pinnacle of human 
thought. (Hush, you hear the distant howls of the mathematicians, English pro- 
fessors, philosophers, and perhaps even a few stuck-up musicologists?) It is a 
distillation of basic notions from the very beginning of the physics: Newton’s re- 
alization that energy is the square of momentum appears in field theory as the two 
powers of spatial derivative. But yet—-you knew that was coming, didn’t you, with 
field theory set up as the pinnacle et cetera?—but yet, field theory in its present 
form is in my opinion still incomplete and surely some bright young minds will 
see how to develop it further. 

For one thing, field theory has not progressed much beyond the harmonic 
paradigm, as I presaged in the first chapter. The discovery of the soliton and 
instanton opened up a new vista, showing in no uncertain terms that Feynman 
diagrams ain’t everything, contrary to what some field theorists thought. Duality 
offers one way of linking perturbative weak coupling theory to strong coupling, but 
as yet practically nothing is known of the strong coupling regime. When speaking 
of renormalization groups, we bravely speak of flowing to a strong coupling fixed 
point, but we merely have the boat ticket: We have little idea of what the destination 
looks like. Perhaps in the not too distant future, lattice field theorists can extract 
the field configurations that dominate. 

Another restriction is to two powers of the derivative, a restriction going back 
to Newton as I remarked above. In modern applications of field theory to problems 
far beyond particle physics, there is no reason at all to impose this restriction. For 
example, in studying visual perception, one encounters field theories much more 
involved than those we have studied in this book. (See the appendix for a brief 
description.) These field theories are Euclidean in any case and the correspond- 
ing functional integral with higher derivatives certainly makes sense: It is only in 
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Minkowskian theories that we do not know how to handle higher derivatives. New- 
ton again-—-certainly economists consider the rate of change of the acceleration as 
well as acceleration. Another innovative application is the formulation! of a class 
of problems in nonequilibrium statistical mechanics as field theories. Typically, 
various objects wander around and react when they meet. This class of problems 
appears in areas ranging from chemical reactions to population biology. 

We can go far beyond the restriction on the number of derivatives in the 
Lagrangian. Who said that we can only have integrands of the form “exponential 
of a spacetime integral’? Most modifications you can think of might immediately 


_ fa tf areca 
run afoul of some basic principles (for example, { Dee fdixi@)-f a*xi@)] 


would violate locality), but surely others might not. Another speculative thought I 
like to entertain goes along the following line: Classical and quantum physics are 
formulated in terms of differential equations and functional integrals, respectively. 
But how are differential equations contained in integrals? The answer is that the 
integrals f Dye" f #2) contain a parameter fi so that in the limit / going to 
zero the evaluation of the integrals amounts to solving partial differential equations. 
Can we go beyond quantum field theory by finding a mathematical operation 
that in the limit of some parameter k going to zero reduces to doing the integral 
[Do eo hy f d*x£@)o 

The arena of local field theory has always been restricted to the set of d real 
numbers x“. The recent excitement over noncommutative field theory promises 
to take us beyond. (I was tempted to discuss noncommutative field theory too, but 
then the nutshell would truly burst.) 

But perhaps the most unsatisfying feature of field theory is the present formu- 
lation of gauge theories. Gauge “symmetry” does not relate two different physical 
states, but two descriptions of the same physical state. We have this strange lan- 
guage full of redundancy we can’t live without. We start with unneeded baggage 
that we then gauge-fix away. We even know how to avoid this redundancy from 
the start but at the price of discretizing spacetime. This redundancy of descrip- 
tion is particularly glaring in the manufactured gauge theories now fashionable 
in condensed matter physics, in which the gauge symmetry is not there to begin 
with. Also, surely the way we calculate in nonabelian gauge theories by cutting the 
Yang-Mills action up into pieces and doing violence to gauge invariance will be 
held up to ridicule a hundred years from now. I would not be surprised if a brilliant 
reader of this book finds a more elegant formulation of what we now call gauge 
theories. 

Look at the development of the very first field theory, namely Maxwell’s theory 
of electromagnetism. By the end of the nineteenth century it had been thoroughly 
studied and the overwhelming consensus was that at least the mathematical struc- 
ture was completely understood. Yet the big news of the early twentieth century 


By M. Doi, L. Peliti, J. C. Cardy, and others. See for example J. C. Cardy, cond- 
mat/9607163, “Renormalisation Group Approach to Reaction-Diffusion Problems,” in: 
J.-B. Zuber, ed., Mathematical Beauty of Physics, p. 113. 
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was that the theory, surprise surprise, contains two hidden symmetries, Lorentz in- 
variance and gauge invariance: two symmetries that, as we now know, literally hold 
the key to the secrets of the universe. Might not our present day theory also contain 
some unknown hidden symmetries, symmetries even more lovely than Lorentz and 
gauge invariance? I think that most physicists would say that the nineteenth-century 
greats missed these two crucial symmetries because of their lousy notation? and 
tendency to use equations of motion instead of the action. Some of these same peo- 
ple would doubt that we could significantly improve our notation and formalism, 
but the dotted-undotted notation looks clunky to me and I have a nagging feeling* 
that a more powerful formalism will one day replace the path integral formalism. 

Since the point of good pedagogy is to make things look easy, students some- 
times do not fully appreciate that symmetries do not literally leap out at you. If 
someone had written a supersymmetric Yang-Mills theory in the mid-1950s, it 
would certainly have been a long time before people realized that it contained a 
hidden symmetry. So it is entirely possible that an insightful reader could find a 
hitherto unknown symmetry hidden in our well-studied field theories. 

It is not just a matter of clearer notation and formalism that caused the 
nineteenth-century greats to miss two important symmetries; it is also that they did 
not possess the mind set for symmetry. The old paradigm “experiments — action 
—> symmetry” had to be replaced* in fundamental physics by the new paradigm 
“symmetry — action — experiments,” the new paradigm being typified by grand 
unified theory and later by string theory. Surely, some future physicists will remark 
archly that we of the early twenty-first century did not possess the night mind set. 

In physics textbooks, many subjects have a finished completed feel to them, 
but not quantum field theory. Some people say to me, what else is there to say 
about field theory? I would like to remind those people that a large portion of 
the material in this book was unknown 30 years ago. Of course, while I feel that 
further developments are possible, I have no idea what—otherwise I would have 
published it—so I can’t tell you what. But let me mention two recent developments 
that I find extremely intriguing. (1) Some field theories may be dual to string 
theories. (2) In dimensional deconstruction a d-dimensional field theory may look 
(d + 1)-dimensional in some range of the energy scale: the field theory can literally 
generate a spatial dimension. These developments suggest that quantum field 
theories contain considerable hidden structures waiting to be uncovered. Perhaps 
another golden age is in store for quantum field theory. 

So boys and girls, the parade is over, and now it’s up to you to get another parade 
going. 


7It is said, and I agree, that one of Einstein’s great contributions is the repeated in- 
dices summation convention. Try to read Maxwell’s treatises and you will appreciate the 
importance of good notation. 


3] once asked Feynman how he would solve the finite square well using the path integral. 
4 A. Zee, Fearful Symmetry, chap. 6. 
5 As the Beatles said, quantum fields forever! 
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Appendix 


An image presented to the visual system can be described as a 2-dimensional Euclidean field 
%o(x), with gg representing the gray scale from black (gy = —00) to white (gp = +00). [You 
can see that color might be included by going to a field ¢ transforming under some internal 
SO(2) group for example.] The image actually perceived, g(x), is the actual image gp(x) 
distorted to gp[y(x)] plus some noise n(x). Distortion is described by a map x > y(x) of 
the 2-dimensional Euclidean plane. Your brain’s task is to decide whether the actual image 
iS Yy(X) or some other g(x). Your ability to discriminate between images depends on the 
functional integral 


_ _ 9 2 
z= f dytx) f Dn(xye MP OHMOS Es oiyayl+ na) — oO} 
= [ dye MOH ONO J Peiveo-vlreor 


where for simplicity I have taken the noise, measured by the parameter C, to be Gaussian 
and white. The weighting function W[y(x)]is presumably hard wired by evolution into our 
visual system, telling us that certain distortions (translations, rotations, and dilations) are 
much more likely than others. Writing y(x) = x + A(x) we note that Z defines a field theory 
of the 2-component field A;(x), which can always be written as A; = 0;n + €;;0;x. Note 
that the field A;(x) appears “inside” an “external” field g). From symmetry considerations 
we might argue that 


I I 
Ww=- / d?x (ne% + x0) 
8? f? 
with two coupling constants f and g. I can give here only the briefest of sketches and 
refer the interested reader to the literature.° Clearly, one can think of other examples. This 
particular example serves only to show that there are many more field theories than those 
described in standard texts. 


6 W. Bialek and A. Zee, Statistical mechanics and invariant perception, Phys. Rev. Lett. 
58: 741, 1987; Understanding the efficiency of human perception, Phys. Rev. Lett. 61: 1512, 
1988. 
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Gaussian Integration and the Central 
Identity of Quantum Field Theory 


The basic Gaussian: 


+00 1.2 
/ dxe2* =VJ2n (1) 
—00 
The scaled Gaussian: 
ro0 12 Qn 2 
/ dxe72% = (=) (2) 
oo a 
Moments: 
1 
+00 z 
/ dxe-1t yg" — (=) S on —NQn—3 5-3-1 @) 
00 a qa" 


Gaussian with source: 


, 4 
+00 
/ dxe 72a tix = (=) ; ef Pa (4) 
—oo a 
i 
+00 . 2 
/ dxe~ sax +isx _ (=) et [a (5) 
—oo a 
ree Ziax?+idx ani 2 -iJ?/2a 
/ dxe2 = (=*) e€ (6) 
00 a 
1 
[~ [~ _ [- dxdx)---dxy er A*HI* — (=) © DEAS (7) 
-co J-o0 —00 det[ A] 
1 
[~ iz - [~ dxjdxy---dxy pode Axtdx _ (G) pitas (8) 
—oo J—oo —o0 det[A] 


In what follows, we omit an overall factor. 
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Central identity of quantum field theory: 
/ Dye 3? 0-VOT@ x g- VOB gh EK 
A trivial variation: 
Variations: 
/ Doel/D0-K-otiF-y =U /2)F-K--I 
/ Doe f 213 9@)Kew)+I@MeOl _ gi f dxl-}J@K VO) 


/ Doe f M249 OKo@+I@ ON — of 44x13 )K-IGD) 


(where K or K~! or both may be nonlocal) 
A specific example: 


/ Doe J MaDe +900 _ of f dxt-a2IGH) 


For K hermitean with g complex: 


—gl K. t t. heel, 
[ Det doe g' K-g+J' o+o Ja el KU 


(9) 


(10) 


(11) 


(12) 


(13) 


(14) 


(15) 


As noted earlier, various numerical factors have been swept under the integration mea- 
sure. In applying these formulas, be sure that these factors are not relevant for your purposes. 


Appendix B 
A Brief Review of Group Theory 


I give here a brief review of the group theory I will need in the text. I assume that you have 
been exposed to some group theory, otherwise this instant review will most likely not be 
intelligible. Most of the concepts are illustrated with examples, and it goes without saying 
that you should work out all the examples and verify the assertions made without proof. 


SOW) 
The special orthogonal group SO(N) consists of all N by N real matrices O that are 
orthogonal 
oTo=!1 (1) 
and have unit determinant 


det O = 1 (2) 


We denote the element in the ith row and jth column by O' . The group SO(N) consists of 
rotations in N-dimensional Euclidean space and its defining or fundamental representation 
is given by the N component vector 0 = {v/, j = 1,..., N}, which transforms under the 
action of the group element O according to 


vio vl = orl (3) 


We define tensors as objects that transform as if they are equal to the product of vectors. 
For example, the tensor T’/* transforms according to 


ilk > piiik _ o! osm om pin (4) 


as if it is equal to the product v' v/v‘. The emphasis is on the phrase “as if”: T'/* is not to 
be thought of as being equal to v' v/y*. 

It is important to develop some “feel” or intuition for groups and their representations. 
Some people find it helpful to picture a certain number of objects being acted upon by the 
group and transformed into linear combinations of each other. Thus, picture T/* as N? 
objects being scrambled together. 
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Tensors furnish representations of the group. In our particular example, each group 
element is represented by an N? by N? matrix acting on the N? objects T*. The number 
of objects in a tensor is called the dimension of the representation. 

It may well be that any given object in a representation does not transform, under all the 
elements of the group, into a linear combination of all the other objects, but only into a subset 
of them. Let me illustrate with an example. Consider T') > TJ = O# O/"7!™ | Form the 
symmetric SJ = 1(T) + T/") and antisymmetric combinations A! = 1(T'/ — TJ"), The 
symmetric combination S‘/ transforms into O/O/"S'™, which is obviously symmetric. 
Similarly, A‘! transforms into 0 O/" A'™, which is obviously antisymmetric . In other 
words, the set of N* objects contained in T’/ split into two sets: 1N(N + 1) objects 
contained in S/ and 3N(N — 1) objects contained in A, The S‘/’s transform among 
themselves and the A’/’s transform among themselves. 

The representation furnished by T‘/ is said to be reducible: It breaks apart into two 
representations. Obviously, representations that do not break apart are called irreducible. 

We just exploited the obvious fact that the symmetry properties of a tensor under 
permutation of its indices is not changed by the group transformation, namely that the 
indices on a tensor transform independently, as in (4). The various possible symmetry 
properties may be classified with Young tableaux, which is useful in a general treatment 
of group theory. Fortunately, in the field theory literature one rarely encounters a tensor 
with such complex symmetry properties that one has to learn about Young tableaux. 

Another way of saying this is that we can restrict our attention to tensors with definite 
symmetry properties under permutation of their indices. In our specific example, we can 
always take T'/ to be either symmetric or antisymmetric under the exchange of i and j. 

We have yet to use the properties (1) and (2). Given a symmetric tensor T'/ consider 
the combination T = 6//T/, known as the trace. Then T > 6/T" = S/O" OI"T'™ — 
(OT) si oimy'™ — gm T'™ — T, where we used (1). In other words, T transforms into it- 
self, We can subtract the trace from T’/ forming the traceless tensor Q') =T!/ — (1/N)8"T. 
The 1 N(N + 1) — 1 objects contained in Q’/ transform among themselves. 

To summarize, given two vectors v and w, we can form a tensor, and decompose the 
tensor into a symmetric traceless combination, a trace, and an antisymmetric tensor. This 
process is written as 


N@N=[3N(N + 1)- 11010 5N(N — 1) (5) 


In particular, for SO(3), 3®3=5 @ 19 3, a relation you should be familiar with from 
courses on mechanics and electromagnetism. 

There are two conventions for naming representations. We can simply give the dimension 
of the representation. (This can occasionally be ambiguous: Two distinct representations 
may happen to have the same dimension.) Alternatively, we can specify the symmetry prop- 
erties of the tensor furnishing the representation. For instance, the representation furnished 
by a totally antisymmetric tensor of n indices is often denoted by [n] and the representation 
furnished by a totally symmetric traceless tensor of n indices by {n}. Obviously, [1] = {1}. 
In this notation, the decomposition in (5) can be written as {1} @ {1} = {2} @ {0} @ [2]. For 
the group SO(3), with its long standing in physics, the confusion over names is almost 
worse than in reading Russian novels: For instance, {1} is also known as p and {2} as d. 
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123...N 


We have yet to use (2). Using the antisymmetric symbol ¢ , we write it as 
gill2---in Qfil Qi22 .. oin’ =] (6) 
or equivalently 
ghi2--tn oii giz | Oinin = eliha.-in (7) 


By multiplying (7) by O' repeatedly, we can obviously generate more identities. Instead of 
drowning in a sea of indices, let me explain this point by specializing to say N = 3. Thus, 
multiplying (7) by (O1)in*n we obtain 


ells Qi ghr = ih gQ! ysis 


Speaking loosely, we can think of moving some of the O’s on the left hand side of (7) to 
the right hand side, where they become O7’s. 

Using these identities, you can easily show that [n] is equivalent to [N — n]. For example, 
as is well known, in 5O(3) the antisymmetric 2-index tensor is equivalent to the vector. (The 
cross product of two vectors is a vector.) 

Any orthogonal matrix can be written as O = e4. The conditions (1) and (2) imply 
that A is real and antisymmetric, so that A may be expressed as a linear combination of 
N(N — 1)/2 antisymmetric matrices denoted by iJ‘/: O = e'®” J” (with repeated indices 
summed over). We have defined J'/ as imaginary and antisymmetric and hence hermitean. 
Since the commutator [J‘/, J*/]is antihermitean, it can be written as a linear combination 
of the iJ’s. 

Ironically, some students are confused at this point because of their familiarity with 
5 O(3), which has special properties that do not generalize to SO(N). 

In speaking about rotations in 3-dimensional space we can specify a rotation as either 
around say the third axis, with the corresponding generator J°, or as in the (1-2)-plane, 
with the corresponding generator J! = —J?!. In higher dimensions, for example 10- 
dimensional space, we can speak of a rotation in the (6-7)-plane, with the corresponding 
generator J®’ = —J”®, but it is nonsense to speak of a rotation around the fifth axis. Thus, 
to generalize to higher dimensions we should write the standard commutation relation [J!, 
J7|=is° for SOG) as [J23, J34] =i 7!?, which can be generalized immediately to 


ri, J] = i(sihyl! — pik! 4 gitpik — sity sky (8) 


The right hand side reflects the antisymmetric character of J‘/ = —J/', A potential confu- 
sion some students may have about the notation: J‘/ denotes a matrix generating rotation in 
the (i-j)-plane, a matrix with element (J‘/)* in the k-th row and /-th column. The indices 
i, j,k, and / all run from 1 to N, but the set {ij} and the set {k/} should be distinguished 
conceptually: The former labels the generator and the latter are matricial indices when the 
generator is regarded as a matrix. As an exercise, write down (J‘/)" explicitly and obtain 
(8) by direct computation. 

In studying group theory, as I have already remarked, one source of confusion comes 
from the fact that some of the smaller groups, which we tend to encounter first in our 
studies, have special properties that do not generalize. The special property of 5O(3) 
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we just noted is due to the fact that the antisymmetric symbol «‘/* carries three indices 
and thus J‘/ may be written as J* = ie/* JJ, For SO(4) the antisymmetric symbol 
e‘Jkl carries four indices and we can form the combinations 4(J‘/ + fe/"' 7), Define 
JleiJ?s 9, Raj t I, and 3 = 4(s? + J). By explicit computation, 
show that [J7 , Hy =islF IE [Ii, Ji = iellk J*, and [Ji J/]=0. This proves the well- 
known theorem that SO(4) is locally isomorphic to SO(3) ® SO(3). 

I assume that you know that §O (3) is locally isomorphic to SU(2). If you don’t, I give 
a brief review below. 

With a few i’s included here and there, these two results prove the statement that the 
Lorentz group §0(3, 1) is locally isomorphic to SU (2) ® SU (2), which we prove explicitly 
in Chapter 11.3. The Lorentz group can be thought of as an “analytic continuation” of the 
rotation group SO (4). 

One highly nonobvious result of group theory is that SO(N) contains representations 
other than vector and tensor. I develop the relevant group theory for the spinor representa- 
tions in Chapter VIL.7. 


SU(N) 


We next turn to the special unitary group SU(N) consisting of all N by N matrices U that 
are unitary 


U'u=1 (9) 
and have unit determinant 
det U = 1 (10) 


The story of SU (N) has more or less the same plot as the story of SO(N) with the crucial 
difference that the tensors of the unitary groups can carry both upper and lower indices. We 
denote the element in the ith row and jth column by U a the wisdom of this notation will 
soon become apparent. 

The defining or fundamental representation of SU(N) consists of N objects g/, j = 
1,..., N, that transform under the action of the group element U according to 


gi > op" =Uig! (11) 
Taking the complex conjugate of (11) we have 
; ; eh wf 
gp" > (U7) = (Uy (12) 


We invite ourselves to define an object we write as y; that transforms in the same way as 
g*; thus 


9 > gi = UY 9, (13) 


Note that we did not say that y; is equal to y*'; we merely said that y; and g*' transform in 
the same way. 
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As before, we can have tensors. The tensor gi , for example, transforms as if it is equal 
to the product y' g/g, : 


gt! > of! =UiuiUtyigl™ (14) 


Again, we emphasize that we did not say that gi! is equal to g' g/y,. (in some books ¢’ is 
called a covariant vector and g; a contravariant vector. A tensor g’’’"’’ with m upper indices 
and n lower indices is defined to transform as if it is equal to the product of m covariant 
vectors and m contravariant vectors.) 

The possibility of complex conjugation in SU(N) leads naturally to having indices 
“upstairs” and “downstairs.” Note that (9) can be written out explicitly as (U Dy U A = 3) and 
thus the Kronecker delta in SU(N) carries one upper and one lower index. It is important 
when taking traces that we set an upper index equal to a lower index and sum over them: 


ko = 9; , which transforms as 


for example, we can consider 5 j 


gy! > UjULU Vg" = Uj gin (15) 


where we have used (9). In other words, yf , the trace of gi , denote N objects that transform 
into linear combinations of each other in the same way as g'. Thus, given a tensor, we can 
always subtract out its trace. 

As in the discussion for SO(N), tensors furnish representations of the group. The 
discussion proceeds as before. The symmetry properties of a tensor under permutation of 
its indices are not changed by the group transformation. 

Another way of saying this is that given a tensor we can always take it to have definite 
symmetry properties under permutation of its upper indices and under permutation of its 
lower indices. In our specific example, we can always take gi to be either symmetric or 
antisymmetric under the exchange of i and j and to be traceless. Thus, the symmetric 
traceless tensor gi! furnishes a representation with dimension +N 2(N + 1) — N and the 
antisymmetric traceless tensor gi a representation with dimension 4N 2(N—1)-N. 

Thus, in summary, the irreducible representations of SU(N) are realized by traceless 
tensors with definite symmetry properties under permutation of indices. For example, 
in $U(5), some commonly encountered representations are g', g'/ (antisymmetric), ¢" 
(symmetric), yi ; gi! (antisymmetric in the upper indices) with dimensions 5, 10, 15, 24, and 
45, respectively. Convince yourself that for SU (4) the dimensions of the representations 
defined by these tensors are N, N(N — 1)/2, N(N + 1)/2, N? — 1, and 1N7(N -1)-N, 
respectively. 

The representation defined by the traceless tensor ¢ is known as the adjoint represen- 
tation. By definition, it transforms according to g; > gi =U}(U oh =Uj¢l (U De We 
are thus invited to regard ¢; as a matrix transforming according to 


g>¢’=Ugt' (16) 


Note that if g is hermitean it stays hermitean, and thus we can take g to be a hermitean 
traceless matrix. (If g is antihermitean we can always multiply it by i.) Another way of 
saying this is that given a hermitean traceless matrix X, UXU" is also hermitean and 
traceless if U is an element of SU(N). 
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As in the $O(N) story, representations of SU(N) have many names. For example, we 
can refer to the representation furnished by a tensor with m upper and n lower indices 
as (m,n). Alternatively, we can refer to them by their dimensions, with an asterisk to 
distinguish representations with mostly lower indices from the representations with mostly 
upper indices. For example, an alias for (1, 0) is N and for (0, 1) is N*. A square bracket is 
used to indicate that the indices are antisymmetric and a curly bracket indicate that the 
indices are symmetric. Thus, the 10 of SU/(5) is also known as [2, 0] = [2], where as 
indicated the 0 (no lower index) is suppressed. Similarly, 10* is also known as [0, 2] = [2]*. 

The condition (10) can be written as either 


Bits iyUpUs .. UN =1 (17) 
or 


e-ingly? | uN =] (18) 
2 in 

Thus, we have two antisymmetric symbols ¢; ;,..;,, and e//2---IN that we can use to raise 

and lower indices. Again, we can immediately generalize (17) to 


iy 17 in _ _ ; 
Pity... iyU U;, -- U;, = Fiya...N 


and multiplying this identity by (U iD yae and summing over jy we obtain 


iy y i2 in—| t 
Phig...pN UU; ° UL = 1 Fjip.. jyU it 
Clearly, by repeating this process, we can peel off the U/’s on the left hand side and put them 
back as U'*’s on the right hand side. We can play a similar game with (18). 
To avoid drowning in a sea of indices, let me show you how to raise and lower indices 
in a specific example Tather than in general. Consider the tensor ¢;! in SU (4). We expect 


that the tensor ¢j,4 = gre ijpq Will transform as a tensor with three lower indices. Indeed, 


Pkpg = Gi Eiipg > Eijpg U; UU yg” = = Elms UY’, (Uy (Uvip fm 
= (UY UY ONG nst 


As in SO(N) we can look at the generators of SU(N) by noting that any unitary 
matrix can be written as U = e'4, with H hermitean and traceless as required by (9) and 
(14). There are (N” — 1) linearly independent N by N hermitean traceless matrices T? 
(a=1,2,...,N*-1). Any N by N hermitean traceless matrix can be written as a linear 
combination of the T?’s and thus we can write U = e!°°T" | where 97 are real numbers and 
the index a is summed over. 

Since the commutator [T7, T? ]is antihermitean and traceless, it can also be written as 
a linear combination of the T@’s: 


[(T?, T°} _ if?T° (19) 
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(with the index c summed over.) The commutation relations (19) define the Lie algebra of 
SU(N), and f2°° are known as the structure constants. For SU (2) the structure constants 
f°¢ are simply given by the antisymmetric symbol 67°. 

Sometimes students are confused by how the generators act. Consider an infinitesi- 
mal transformation U ~ 1+ i@°T. On the defining representation, g' > Uigi wg + 
i A(T)! . Thus, the ath generator acting on the defining representation gives T°y. Now 
consider the adjoint representation (16) 


o> o ~ (1+ i8°T)g(1 + i0°T2)' ~ yg + i0°T%y — gid*T? = 9 + iO [T, 9) 
(20) 


In other words, the ath generator acting on the adjoint representation gives [T°, g]. Perhaps 
some students are confused by the fact that y is used as a generic symbol to denote different 
objects. 

Since the adjoint representation ¢ is hermitean and traceless it can also be written as a 
linear combination of the generators, thus ¢ = g’ T°. Using (19) we can thus also write (20) 
as gf > yo ~ gf — f%°6%y. In particular for SU (2), the three objects y" transform as a 
3-vector. (Note the notation: ¢? is not to be confused with g! :in SU (2) the index a = 1, 2,3 
while i = 1, 2.) 

This last remark essentially amounts to a proof that $U(2) is locally isomorphic to 
SO(3), a proof which I will now give. Any 2 by 2 hermitean traceless matrix can be written 
as X = xX -@, where a denote the Pauli matrices. Impose the condition .X 2 — 2, which is 
equivalent to x” = 1, so that x is a unit vector. For any element U of SU(2), X'=UXU Tis 
hermitean and traceless so that we can write X’ = x’. &. Since X? = trX? the unit vector 
X is rotated into the unit vector x’. Thus, we can associate a rotation with any given U. Since 
U and —U is associated with the same rotation, this gives a double covering of SO (3) by 
SU(2). 

Once again, the two special unitary groups that most students learn first, namely SU (2) 
and $U(3), have special properties that do not generalize to SU(N), just as SO(3) has 
special properties that do not generalize to SO(N), possibly leading to confusion. 

For SU (2), because the antisymmetric symbol «’/ and ¢;; carry two indices, it suffices 
to consider only tensors with upper indices, all symmetrized: We can raise all lower indices 
of any tensor by contracting with e” repeatedly. After this is done, we can remove any pair 
of indices in which the tensor is antisymmetric by contracting with ¢;,. 

In particular, g; = ¢;;¢/ , which can be stated equivalently in terms of a special property 
of the Pauli matrices 


070707 = —Gy (21) 
so that 
o> eis 0 = eibe (22) 
For SU (2) (11) becomes 
gi > o! = (2) gl 
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Complex conjugating, we obtain 

ot > (() To" = [(—ione (ian) I." 
and so 


ide 


bong” > e'”" (iang") 

We learn that io.y* transform in the same way as y. Recall that we define g; to transform 
in the same way as y™'. Thus, ey; transforms in the same way as g’. In the jargon, SU (2) 
is said to have only real and pseudoreal representations, but not complex representations. 
A pseudoreal representation is equivalent to its complex conjugate upon a similarity trans- 
formation. Recall that (21) figures into our discussion of charge conjugation in Chapter II.1 
and of the Higgs doublet in Chapter VII.2. 

For SU (3) it suffices to consider only tensors with all their upper indices symmetrized 
and all their lower indices symmetrized. Thus, the representations of SU (3) are uniquely 
labeled by two integers (m,n), where m and n denote the number of upper and lower 
indices. The reason is that the antisymmetric symbols ¢'/* and ¢;;, carry three indices. We 
can always trade a pair of lower indices in which the tensor is antisymmetric for one upper 
index, and similarly for upper indices. 

You can see easily that these special properties do not generalize beyond SU (2) and 
SU (3). 


Multiplying representations together 


In a course on quantum mechanics you learn how to combine angular momentum. We 
have already encountered this concept in (5), which when specialized to $O (3), tells us 
that 3@3=5 @ 163, as we noted. This is sometimes described by saying that when we 
combine two angular momentum L = | states we obtain L = 0, 1, 2. Students are justifiably 
confused when this procedure is also known as “addition” of angular momentum. 

Given two tensors g and 7 of SU(N), with m upper and n lower indices and with m’ 
upper and n’ lower indices, respectively, we can consider a tensor T with (m +m’) upper 
and (n + n’) lower indices that transforms in the same way as the product yr. We can then 
reduce T by the various operations described above. This operation of multiplying two 
representations together is of course of fundamental importance in physics. In quantum 
field theory, for example, we multiply fields together to construct the Lagrangian. 

As anexample, multiply 5* and 10 in SU (5). To reduce T;! = y,n'! we separate out the 
trace y,n") (which transforms as a 5) after which there is nothing more we can do. Thus, 


5*@10=5@45 (23) 


As another example, consider 10 @ 10: y’/ 7". It is easiest to write 7 equivalently as a 
tensor with three lower indices &,.4)11". The product 10 ® 10 then carries two upper and 
three lower indices and we will write it as T,”,,. Taking traces, we separate out T,, p> Which 


we recognize as 5*, and the traceless part of T' , which we recognize as 45* (see above), 


mny? 
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thus obtaining: 
10 @ 10 = 5* @45* @ 55* (24) 
As exercises you can work out 
5@5=10@ 15 (25) 
and 
5@5*=1624 (26) 


You should recognize the 24 as the adjoint. 

In physics we are often called upon to multiply a tensor by itself. Statistics then plays 
a role. For instance, SU(5) grand unification contains a scalar field g' transforming as 5. 
Because of Bose statistics, the product g’g/ contains only the 15. 


Restriction to subgroup 


To explain the next group theoretic concept, let me take a physical example. The SU (3) of 
Gell-Mann and Ne’eman transforms the three quarks uv, d, and s into linear combinations 
of each other. It contains as a subgroup the isospin SU (2) of Heisenberg, which transforms 
u and d, but leaves s alone. In other words, upon restriction to the subgroup SU (2) the 
irreducible representation 3 of SU (3) decomposes as 


33201 (27) 


Consider an irreducible representation with dimension d of some group G. When we 
restrict our attention to a subgroup #7, the set of d objects will in general decompose into 
n subsets, containing d), d,,...,d, objects, such that the objects of each subset only 
transform among themselves under the action of H. This makes obvious sense since there 
are fewer transformations in H than in G. 

The decomposition of the fundamental or defining representation specifies how the 
subgroup H is embedded in G. Since all representations may be built up as products 
of the fundamental representation, once we know how the fundamental representation 
decomposes, we know how all representations decompose. For example, in SU (3) 


3@3*=861 (28) 
while in SU (2) 
QEehvpve2Qg)=3be)e202e1 (29) 
Comparing (28) and (29) we learn that 


8> 3016262. (30) 
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Alternatively, we can simply look at the tensors involved. Consider g’ of SU (3) where 
the index i takes on the value 1, 2, 3. Let the index yz takes on the value 1, 2, Obviously, 
gy! = {¢", y*} corresponds to an explicit display of (27). Then gi = {¢", @, oe , 93}, where 
the bar on ¢* is to remind us that it is traceless. This corresponds precisely to (30). 

Actually, SU (3) also contains the larger subgroup SU (2) ® U(1), where the U(1) is 


generated by the traceless hermitean matrix 


—1 0 0 
0 -1 0 
0 0 2 


We can then write (27) as 3— (2, —1) @ (1, 2), where the notation is almost self- 
explanatory. Thus, (2, —1) denotes a 2 under SU (2) with “charge” —1 under U(1). 

In the text, we will decompose various representations of SU (5) and SO (10). Everything 
we do there will simply be somewhat more elaborate versions of what we did here. 
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Feynman Rules 


Here we gather the Feynman rules given in various chapters. 


Draw all possible diagrams. Label each line with a momentum. If applicable, also label 
each line with an incoming and an outgoing Lorentz index (for a line describing a vector 
field), with an incoming and an outgoing internal index (for a line describing a field 
transforming under an internal symmetry), so on and so forth. Momentum is conserved 
at each vertex. Momenta associated with internal lines are to be integrated over with the 
measure {[d* p/(27)*]. A factor of (— 1) is to be associated with each closed fermion loop. 
External lines are to be amputated. For an incoming fermion line write u(p, 5) and for an 
outgoing fermion line #(p’, s’). If there are symmetry transformations leaving the diagram 
invariant, then we have to worry about the infamous symmetry factors. Since I don’t trust 
the compilations in various textbooks I work out the symmetry factors from scratch, and 
that is what I advise you to do. 

As I have said repeatedly, computational prowess is not the point of this book. Thus my 
compilation of the relevant Feynman rules is certainly not exhaustive. 


Scalar field interacting with Dirac field 


_ l x _ 
L=wpliy"d, —my + 5 Lae)” — 29") - ne + foyy (1) 
Scalar propagator: 
k i 
2 
TS ee _ ne k? — p? +ie @) 


AT] 
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Scalar vertex: 


Scalar fermion vertex: 


Initial external fermion: 


Final external fermion: 


Initial external antifermion: 


Final external antifermion: 


u(p, S) 


u(p, S) 


v(p, $) 


v(p, $) 
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i . ptm 


ne EE 
p-mtie p*—m+ie 


Vector field interacting with Dirac field 


L=pliy"(a, —ieA,) — my — 4 FyyFYY — 57 A, AY 


Vector boson propagator: 


(3) 


(4) 


(S) 


(6) 


7) 


(8) 


(9) 


(10) 


(1) 


Feynman Rules 


Photon propagator (with € an arbitrary gauge parameter): 


k 
i k uky 
SLD DUP ip ja 5 - by 
Vector boson fermion vertex: 
UL 
iey" 

Initial external vector boson: 

Ey, (k) 
Final external vector boson: 

E,,(k)* 

Nonabelian gauge theory 


Gauge boson propagator: 


k ; kk 
i 
DPLDLL LPL PPP PPD k2 ja —§) ez 7 én] Sab 
Ghost propagator: 
k i 
mee eee eee p-------- pw 


Cubic interaction between the gauge bosons: 


af gy ylki — kaa 


+ By, (ky — ks)y + Bap (ks — Avy] 
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(12) 


(13) 


(14) 


(15) 


(16) 


(17) 


(18) 
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Quartic interaction between the gauge bosons: 


ap byv 
_ ig [fore f(g yr 8up ~ Sup8Bvad 
+ fe £815 Buy ~ BuvBpa) (19) 
d CA + £2 f° (ByBrp ~ Sup8vr)1 
ip 
Gauge boson coupling to the ghost field: 
GH 
g fer pt (20) 
oom se 
PB’ pe 
ze. an 


For your convenience I also list the formulas for computing cross sections and decay rates. 
Given the Feynman amplitude M for a process py + po > ki tho +---+hk, the 
differential cross section is given by 


_ 1 5h ak, 
|B) — HIECPYE(P») AmPE(y) Aa RE(k,) 


(28 Cp + pr — D> ky)IM? (21) 


i=] 


Here ¥, and vz denote the velocities of the incoming particles. The energy factor €(p) = 
2,/ p? + m for bosons and E(p) = «fp? + m?/m for fermions come from the different 
normalization of the creation and annhilation operators in Chapters 1.8 and II.2. If there are 
n; identical particles of type i in the final state, the statistical factor § = I; [1/n;!] accounts 
for indistinguishability. 

For a decay of a particle of mass M the differential decay rate in its rest frame is given 
by 


1 Pk, Bk, 


= oo om i om 45(4) “SE 9 
IM Gaye) Gmeay ms OP LIMP 22) 


i=l 


dT 
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Various Identities and Feynman Integrals 


Gamma matrices 


Identities for the trace of a product of an even number of gamma matrices: 
tryty" = 4n#" (1) 
ryt y yr y? = 4 (gti? — hn’? +n”) (2) 


We define the totally antisymmetric symbol e#”'* by ¢9!23 = +1 (note &9)93 = —1). Then 
with our definition y> =iy®°y'!y?y3, we have 


tryyty yy? = —4ieh? (3) 


Identities that follow from the basic Clifford identity: 


v" By, = —2 (4) 
Y" PR AY, =4P°4 (5) 
V"* PAY, = 27d B (6) 


I leave it to you to derive these identities. For example, to obtain (4) keep moving y* to the 
right in the expression y* py, = (2p* — py")y, =2 p—4 p=—2 p. 


Evaluating Feynman diagrams 


Over the years, a number of tricks and identities have been developed for evaluating the 
integrals associated with Feynman diagrams. 
Let us evaluate 


_ f dtk 1 _f ak f dky 1 
JS Qn) =m +isy J xP J lm [kB — 2 + m2) + iP 


Focus on the kg integral. Draw where the poles are in the complex kg-plane and you will see 
that the integration contour can be rotated anticlockwise so that [we denote the integrand 


by f (ko)] 
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+00 +100 +00 
| ahoftha) =f aka fa) =i fda f ike a) 


oO —~1XO —OX 


where in the last step we define ky = ik, (corresponding to the Wick rotation mentioned in 
Chapters I.2 and V.2.) Thus, 


dik 1 


— (133 
Tat | Gay + mp 


where d‘k is the integration element in Euclidean 4-dimensional space and k2. = k? + k? 
the square of a Euclidean 4-vector. The infinitesimal ¢ can now be set equal to zero. We can 
integrate immediately over the three angles since the integrand does not depend on them. 
You can look up the angular element in Euclidean space in a book, but we will use a neat 
trick instead. 

I will do the more general d-dimensional integral H = f d7kF (k?), where k? = k? + 
ki +---+k4 and F can be any function as long as the integral converges. (I now drop 
the subscript £; the context makes clear that we are in Euclidean space.) We can of course 
set d equal to 4 at the end. The result for arbitrary d will be useful to us in regularizing 
dimensionally (Chapter III. 1). 

We imagine integrating over the (d — 1) angular variables to obtain H = C(d) ts” dk 
k¢-1 F(k?). To determine C(d) we will do the integral J = f dtke7 2” in two different 
ways. Using (1.2.8) we have J = (./27r)?. Alternatively, 


Oo Oo 
J=C(d) i dk k4-1¢e-2" — C(d)23-} [ dx xtle-* = cain 
0 0 


where we changed integration variables and recognized the integral representation of 
the gamma function ['(z + 1) = ts dx x’e~*. (Recall that upon integration by parts we 
obtain ['(z + 1) = zI'(z), so that [(n) = (n — D! for n an integer.) Therefore C(d) = 
2n4/? 1 T(d/2) and 


| a‘kF (k*) = an" [ * dk KE 2) (8) 
P(d/2) Jo 


Setting d = | in (8) we determine ['(4) = 72, and setting F(k) = 8(k — 1) we see that 
the area of the (d — 1)-dimensional sphere is equal to C(d), thus recovering various results 


you learned in school about circles and spheres: C(2) = 27 and C(3) = 47. 
The new result you need as a budding field theorist is ford = 4: 


/ aKF (k*) =n? | dk*k? F (i?) (9) 


So finally we have 


—tI 2,2 1 —t 1 
I=— je —_—___ = —___ 10 
162 i (k2 4+ m2)3— 16m2 2m? (10) 
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We have derived the basic formula for doing Feynman integrals: 


d‘k 1 _ =i 
(27)4 (k2 — m2 +ie)3 3202m? 


(il) 


(With the telltale ig we have evidently moved back to Minkowski space.) As an exercise 
you can go through the same steps to find 


A d*k 1 i A? 
Rt gg f 2)... 12 
(2x)* (k* — m2 + ie)? va [8 ($5) + (12) 


Here a cutoff is needed, which we introduce by setting the upper limit in the integral over 
k2 in the analog of (10) to A?. As a check, differentiate (12) with respect to m* to recover 
(11). As another exercise show that 


A dk k —i 
——_—s = — 2m?) 13 
(27 )4 (k2 —m2+ie) 162 [a? m (5 ;) tm + | m9) 


In (12) and (13) (- - -) denote terms that vanish for A? >> m2. In some texts, the (—1) in (12) 
is dropped by absorbing it into A?. But then we have to be careful to adjust (13) accordingly 
if it appears in the same calculation. 

A useful identity in combining denominators is 


= (n — of [- f dada .. 
XX). Xp 


= 1 
~ Soa; | ———__—_—___ (14) 
+ Xn)" 


F (ajxy + Go%.+°°- 
For n = 2, 
1 
——_ i da——_—_1___ (15) 
xy Jo [ax+(l—a)yP 


and for n = 3, 


1 1 1 1 1 
—=2f [ [ dadBdyS{a + B+ ¥ — 1). 
XYZ 0 Jo Jo p Pty (ax + By + yz)3 


1 
=? ee 
/ | a [Z+a(x—z)+ Bly _ z)P 


(16) 


where the integration region is the triangle in the a-8 plane bounded by 0 < £ < 1— a@ and 
O<a<l. 
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Dotted and Undotted Indices and the 
Majorana Spinor 


We develop the dotted and undotted notation introduced in Chapter IT.3 for further use in 
discussing supersymmetry in Chapter VIII.4. In essence, the appearance of undotted and 
dotted indices can be traced back to the fact that the Lorentz algebra $O(3, 1) is locally 
isomorphic to SU(2) x SU(2). The absence or presence of the dot allows us to keep track 
of which SU (2) we are talking about. 

Here I will use results from Chapter II.3 and from the exercises (do them!) there 
extensively without bothering to write them down again. 


Knowing that 
0 ot 
yt= (Z. 0 ) (1) 


(1) 


we see that o” and o” carry indices as follows: 


acts on 


(o")o, and (aH)% (2) 


This is consistent with what you know, that the Lorentz vector transforms as G , 3) and 
thus straddles the two SU (2)’s. The matrices o/ and a mix dotted and undotted indices. 

Let us check that the Lorentz transformation property of the Dirac spinor W is consistent 
with what was discussed in Chapter IE.1. There we learned that ¥ > e~@/#@uwE"" w where 
De = (i /2)[y", y”]. (We want to use the symbol o”” for some other quantity and hence 
the change of notation.) Using (1) we obtain 


EH — 9} (> 0 ) 
QO ag? 


where o#” = i(o#e” —o's") anda" = i(GHo” — o’o). From (2) we see that these 
two matrices carry indices as follows 


(oH)? and GY) (3) 
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Again, this reflects the fact that the antisymmetric tensor (such as the electromagnetic field 
F,,,) transforms as (1, 0) + (0, 1). Thus, under an infinitesimal Lorentz transformation 


Va > E+ 50,0 go we (4) 
and 
Ko > I+ f@pyo"’)® gx? (5) 


You should check that it all works out according to plan. Everything is consistent with what 
we learned in Chapter 11.3, in particular, that boosts act oppositely on (4, 0) and (0, 4). 
Thus far, on the spinor fields y, and x“, the dotted indices always live upstairs and the 
undotted indices downstairs. What would get them to change floors? Charge conjugation. 
Recall from Chapter II.1 that the charge conjugated field is defined by ¥“ = CW [where 
T denotes transpose, ¥ means Wty, and C~!y#C = —(y#)"]. In the Weyl basis, we 
can choose C = ty°y?. The condition (¥°)° = W implies |¢| = 1. We choose ¢ = —i. 


Explicitly, 
v= ( ioax” ) 
—lo7py 


We now introduce some notation the wisdom of which will soon become clear. Given 
yw, and x, define 


a= We) and x7 = (X*)* (6) 


Weird, complex conjugation puts on a dot and a bar. 
We raise and lower undotted indices as follows: wy = ap? and yf = 6 y, which 
implies that ¢,,6°” = 6,7. Thus, if we choose 


0 1\_. 
ae o) = oder 


then 


0 -l 
By _ = (—io5)8Y 
7 (; >) (—to2) 


We are forced to define ¢,> and ¢!* to be opposite in sign, a fact to keep in mind. 
You should realize by now that what we are doing can again be traced back to that 
peculiar fact about Pauli matrices (see Appendix B) that (f02)0*(—io7) = —9;. 
Analogously, we raise and lower dotted indices as follows: 


Ye = e,av? and yh = cP yy 


Referring to (6) we see that €, 4 is numerically the same as &), and eFY is numerically the 
same as FY, 


\ 
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You now see that the rationale of these apparently capricious choices is so that we can 


now write 
x 
wea] 4 7 
(js) o 
Referring to 
v=(¥5) (8) 
x 


we see that the point of the notation is that y, and x, transform in the same way and are 
the same kind of creature (and similarly for x°% and y*). 

We now come to the all important concept of a Majorana spinor. A brilliant physicist, 
Ettore Majorana mysteriously disappeared early in his career. Fermi supposedly described 
Majorana as “‘a towering giant without any common sense.”! 

Given a Dirac spinor WV, if ¥ = V°, then V is said to be a Majorana spinor. Comparing 
(8) and (7) we see that a Majorana spinor has the form 


An obvious remark but a handy mnemonic: Given a Weyl! spinor yy, we can construct a 
Majorana spinor, and given two Weyl spinors we can construct a Dirac spinor. “One Weyl 
equals one Majorana, and two Weyls equal one Dirac.” 

Incidentally, another way of seeing that complex conjugation puts on a dot is that (see 
Chapter II.3) conjugation interchanges J +iKk and J —ik and hence the two SU (2)’s. 

The point of the notation is similar to that of the covariant and contravariant (or upper 
and lower) indices in special and general relativity. We always contract an upper index with 
a lower index. Here we have the additional rule that an undotted upper index can only be 
contracted with an undotted lower index, but never with a dotted lower index, (obviously 
since they belong to different groups.) It is easy to verify these rules. For example, using 
(4) we find 


nt > nf = Fy, = 6° (2) 91 gy = oF (087) ge, gM? = (0727 )* gn? (10) 


where we used once again (—io>)o;(@07) = —o} , and thus 7° yy, is indeed invariant (since 
1 Wray > (ew 227" )T (03? yp = ny). 

In special and general relativity we raise and lower indices with the metric, which is of 
course symmetric. Here we raise and lower indices with the antisymmetric ¢ symbol and 
as a result signs pop up here and there. For example, ny, = en Ba = 7 a(—eF* ary = 
—ngwe. Contrast this with the scalar product of two vectors v“w, = v,,w". if we want 
to suppress indices and write ny we must decide once and for all what that means. The 


'M. Gell-Mann, private communication. Incidentally, the name Ettore corresponds to 
Hector in English. 
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standard convention is to define 
ny =n Wy (11) 


and not ng yw. This rule is sometimes stated by saying that in contracting undotted indices 
we always go from the northwest to the southeast, and never from southwest to northeast. 
As we learned in Chapter IL.5 spinor fields are to be treated as anticommuting Grassmann 
variables under the path integral so that —ngy? = yb ng. We end up with the nice rule 
ny =n. 

Similarly, we define 


x§ = gk" = Ex (12) 


In contracting dotted indices we always go from southwest to northeast. 

Just as in special and general relativity, where the upper and lower indices are very 
useful in telling us whether expressions we write down make sense or not, the undotted 
and dotted upper and lower indices allow us to see immediately that ny and no4’y 
make sense, but no“ does not. [Look at (2) and (3) and see the kind of indices that 
appear.] The notation of course just codifies in a convenient way the group theory fact 
that (3, 0) ® (4, 0) = (0, 0) @ (1, 0), namely that out of two Weyl spinors we can make a 
scalar and a tensor but not a vector. 

As always, notation should be driven by physics and by computational convenience 
(which is intimately connected to elegance). 

To gain familiarity with the dotted and undotted 2-component notation you should work 
out some of the identities in the exercises. These identities are useful when working with 
supersymmetric field theories. 


Exercises 
E.1. Show that no#’y = —yo'n and xo" = —Yo"X. 


E.2. Show that (99)(x&) = — 3 (00"§)(XG,,9). 


E.3. Show that 9°%9, = 5 (00) 54. [Hint: Simply evaluate the two sides for all possible 
cases. ] 


Solutions to Selected Exercises 


PartI 


1.3.1. From the text we have for x° = 0, 


3 ard 
D(x) = ~i / at he 


ee 

(2m)32/ k2 + m? 

i © dkk? ft! 
~ 2Qmry Jo Sf 4m? J 


ikr etkry —_ 1 [ dkk eikr 
0 


8x27 Jk? + m2 


d(cos Oye thr cos @ 


4 t dkk_¢ 
2(27)2r 0 Vk? + m? 


kr 


=e. dk i 
~~ Sr Ar Joo Vee + m2 


The integrand in J = [°° (dk/v k2 + m2)e!*" has a cut along the imaginary axis 
going from im to ico (and another cut we don’t care about.) So fold the contour 
around the cut and change variable to k = i(m + y): 


1 
Vy +m)? — m? 


[e,2] 
r=2f dye W+mr 
0 


oO mn 1 

=2f due" —__ 
l Vu2—1 
oO 


=? [ dte—™ cosh t 
1] 


At this point you can look in a table and find that this is some Bessel function 
and read off the large r behavior, but it is more stylish to press on and descend 
steeply: we obtain 
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: fore] 
D(x) =- in dt (cosh rye" “sh! 
4n?r Jo 


—_ im ah d(sinh the mr cost 
4n?r Jo 
—mr V2 = fe dso Ut 5 s@ 
- 7 r 
mm (tte m, 
~ Ag? 2(mr)> 


using the Gaussian integral from the appendix of Chapter I.2. 


1.3.2. We evaluate 


a7k eikx 


Dix) = | —{_* 
(*) Qny2ke— m2 +ie 


by contours as in the text and obtain 


ak ; 
D(x) =-i —i(et kG (40) 4 pilex? g (x9 
(x) if Ondo," (x")+e (-x°)] 


For x° = 0, we recognize the integral 


+00 ; 
D(x) =-i / ak its 


—co (Qr)2Vk2 + m? 


as a Bessel function from exercise [.3.1: 
D(x) = = Ko(mmix|) ey 
2m\x| 


with the expected exponential decay for large x. 


1.7.2. Expanding and keeping only the desired terms 


4, 74 b ° b . 
ZU) >e [1+ rT (-i») [fe wiawa Enon law | 
i E ff d‘xd*tyJ(x)D — yoy) 
6! 2 


Just keep on differentiating. 


L7.4. Write ky = (Ve + m2, 0,0, k) and ky = (/k2 + m2, 0, 0, —k). Then E = 
2/k2 + m? > 2m. Physically, a pair of mesons can be produced when E > 2m. 


1.8.2. Clearly, only the terms aa‘ and a‘a in H contributes to < k’'|H|k > . Extract these 
two types of terms in 


1.8.4. 
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/ d? x(x)? 


“JI 7m 
(27) 2a, ,f (21)? 2a,) 
[a (Gat Ge tat 42 gi ogt-F'-¥) +hic.] 


dq => + > + => > 
_ / o 4 fa Ga" (g) + a4 Gag)! 
20,4 


and so H is for our purposes effectively equal to f dq =2[a(g)a'(@) +.a'@) 
a(q)], which upon using the commutation relation is equal to f dq 74 [5)(0) + 
2a" (q)a(q)]. We recognize the first term as the vacuum calculated in the text. 
Note that the definition of the delta function (21)? 8) (k) = f d?xe'** implies 
§)(0) = [1/(27)”] f d?x = V/(27)”. Thus, subtracting off the vacuum en- 
ergy, we have H effectively equal to f d°qu,a'(q)a (q), which just says that a 
mode of momentum gq carries energy w,. In particular, using the commutation 
relation twice we have < k'|H|k >= 8)’ — )a,. The energy of a particle of 
momentum & is «, relative to the vacuum. 


Q = f d?xJo(x) = f d°x(y'idpy — i(ayy )¢). Focus on the first term: 


Joffe 
(27)? 2a, / (27)? 2a, 

[at (Rel @ut-#3) + bein yyy fake tout kD _ bt eiout-k 2) 

Note that id) brings down a factor of w, and produces a relative sign between 


a and bt. As in Exercise 1.8.2 the integral over x produces a delta function that 
collapses the two k integrals into one, giving 


/ ak (a @a(k) — (bt (® — at(—kybt Re" +. b(—k a De!" 


The second term in —i(4g")g) in Jo(x) is just the hermitean conjugate of the 
first term g'idy. Thus, adding the hermitean conjugate of what we have just 
obtained, we find 


QO= / dP k[a’ (kak) — bb (k)] 


_ | dP kat ak) — bib) +8) J dk 


The infinite additive constant is to be subtracted out much like the vacuum 
energy. In some texts a normal ordering operation, denoted by a pair of colons, is 
defined as follows: If you see : (- . -) : you are instructed to move all the creation 
operators in the expression (---) to the left of the annihilation operators. In 
other words, by fiat : b(k)bt(k) : = bt(K)b(k). The current is then defined by 
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J(x) =: (vii On.~ —i (3,,p")g) :. Since the normal ordered current differs from 
the naively defined current by a c-number the most crucial property of the current, 
namely current conservation 0,,/*% = Q, is not affected. This is of course just a 
formal way of saying that the value of the charge in the vacuum state is to be 
subtracted. In any case, the result Q = f d?k [at(k)a(k) — b*(K)b(k)] shows that 
a and b annihilate positive and negative charges, respectively. 


(0| T[A(x)A (O)] (0) = O(x°) 0} e!?*A(O)e#?* AO) [0) 
+ 6(—x°)(0| A(O)e!?* A(O)e 7? |0) 
and 


(0| e?*A(0)e/?* A(O) [0) = ) | 0] AC) |n) (ne? * A(0) [0) 
—_ > ett AG? 


where Ao, = (0] A(Q) |n). Using the representation of the @ function and integrat- 
ing, we get 


/ d*xe!% (0| T(A(x)A(O)) |0) 


1 


OG + | 
Po +q°— is 


. 1 - 3 
=-i (27r)° > | Aon |” Pak ~ P,) + 
n 


n 


and thus 
Im { / d*xe'I* (0 r(Aacs)acoiioy} 


= m2) >> ton 18g — Pp) + 8G + Prd] 
4 
which is precisely 3} f d+xe!%*(0|[A(x), A(O)]|0) upon inserting 1 = 
> |n)(n| and integrating. Note that for g° > 0 the term involving 8 (g + P,) 
drops out. 


. We have (repeated indices summed) 


Roa! Ropi Day (x) -| DeRaapa'(*) Ry py (O)e"* 


But we can change the integration variable from g to Rg. Since the action S 
and the measure Dg are both invariant under SO(N) rotations, this is equal 
to f Dega(x)g,(O)e'> = i Dg, (x). Thus, we obtain Dj, = Rag: Rpp- Day. The 
properties of the rotation group are such that the only solution of this equation is 
Dap proportional to 8,,. 


1.9.3. 


1.10.2. 
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The field g transforms as a symmetric traceless tensor (see Appendix B) under 
SO(3), that is, with all indices displayed, 9,4 > Rag Roy Pay = Raa'Pa'r' Ry, = 
(RgR'),». As suggested in the hint, writing g as a 3 by 3 symmetric traceless 
matrix we have g > RgR? and thus the invariants are (up to quartic order in 
g) tr(3,9)", tr g”, try’, and (tr y”)?. Remarkably, you can prove that tr y* and 
(tr y?)? actually amount to only one invariant by diagonalizing 


a 0 0 
g={|0 8B 0 
0 0 -(a@+ 8) 


You can see by computation tr g? and (tr g”)? are both proportional to [a? + 
B*’ + (a + 8)’. Thus, if we restrict ourselves to quartic terms the Lagrangian 
L = 4 (8,9) — 3m? tr gy” — Ary)’ actually hasan SO (5) symmetry (since g 
has 5 components.) This is an example of what is known as “accidental symmetry.” 
Convince yourself that this holds only to quartic order in g. 


Varying g“? 2, = 3, we have (8g°)g ., = —g%°(8g,,), which upon multiplica- 
tion by g*” becomes 5g%” = —g#? (68,8. You may recognize this as just the 
statement 6M~! = —M~—1(M)M7—! for a matrix M. To evaluate 8g we use the 
important identity det M = eT '°™, which you can prove easily by diagonalizing 
M with a similarity transformation. The left hand side is equal to the product of 
the eigenvalues of M, while the right hand side is equal to the exponential of the 
sum of logarithms of the eigenvalues. {You can define the logarithm of a matrix 
by expanding log[7 + (M — 1)] in a power series in (M — [).} 

Thus, 5 det M = (det M) tr M—'8M and so 5g = gg "dg. We are now ready 
to vary 


s= f dx /=Eh(8"9,0,0 — m’o’) = f dtx/= a6 
Plugging in, we have 
sS= / dx =H 4858-8! (Bi u)8™” 5 8.08, 
Thus, 
2 388 


—— = gh ga,pd,o — gL 


V—8 OB uy 


Te = 
In the flat spacetime limit 
if > 
T™ = (dg) —L= 5 (C09)? + (Vg) + m’9’) 


precisely the energy density as promised. 
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1.10.3. Using the expression for T“” from the preceding exercise, we have 
Pi= J xT" = — f Pxdopay 
and 
(Pi paol=— f @yl&%90), 0100) = 18.00) 


Thus, combined with the fact that P° = H, we have [P“, g(x)] = —id“ (x), 
which just reflects the fact that P“ and x” are conjugate variables. 


1.10.4, Evaluating T,,,, = —F,,F,) — ty,&, we have 


Tyj= 


j= FF} + 15,,(E? — B?) = —E,E; + Fix F jp + 48;(E? — B*) 


Since Fi, F jp = Exemé jin Bm Bn = 5,;B? — B,B;, we obtain the announced result. 


Note that 6; ;7;; = 1(E? + B*) =T, and hence T = 0. 
joiy ~ 2 00 


Part II 
I.1.1. Continuing the hint, we have 


spy yy) =o- <eiplo™, y“y lv = ¢40,,[o"?, y“ly?y 


since y> anticommutes with gamma matrices and hence commutes with the 
product of two gamma matrices. Inserting [o*?, y“] as given in the text we 
have 8(vy4y*y) = of vy’ y>y, which is precisely how a vector transforms. 
Under parity yy 5p > Py °y“y>y°w which equals, Py°y v= —py yw 
for x = 0, and Py°y' yyw = vy' yw for ~ =i. The time component flips 
sign while the spatial components do not. Thus, the behavior under parity is 
opposite to that of a normal vector: yy“y*y is an axial vector. The other cases 
proceed similarly. 


1.1.2. From yy, = al ~ py) and yp = 4+ yy, we find vy, = ¥jy® = wiZ(I- 

yy =v t+ y>) and pp=v3(l—y?). We then just use the Properties of P;, 

and Pp repeatedly. For example, #, Wp = v5. + ae and Yrvy = 051 — 

y°)y, or equivalently, yy = vite + Urv, and py?y = vide — Save: 

As another example, Yuri =W3 Lt yp ys —- yw =vy4*3- yy 

and trv be = =ypy" 3(1 +y>)y. Note that various combinations vanish, for 
example, Wwew, =0, vry“ve =0, and so on. Complete the exercise. 


.3 and 11.1.4, In the appropriate basis the Dirac equation becomes 


SS Pal 
— po, —-E-m x ~ 


I.1.7. 


1.1.11. 


1.2.1. 
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thatis, (E —m)@ + po3x = Oand —po3d — (E + m)x =0. The second equation 
informs us that x = —[p/(E + m)]o3@. For a slow electron x ~ —(p/2m)o3¢, 
so that x is smaller than ¢ by the factor p/2m. The first equation then reduces to 
(E — m — p*/2m)¢ = 0, which just reminds us of the relation between energy 
and momentum in the nonrelativistic limit. 


. In the Wey] basis the Dirac equation for a relativistic electron moving along the 


3-axis E(y° — y3)w = 0 becomes 
0 T-—o?\ (vi \_ 0 
I + a? 0 Wr a 


1 3 
1 27 yl 42 _3,,_{(a 9 
ly. yl=-Sloo71@i=o e1=(4 5) 


Since 


under arotation around the 3-axis, y, > e~@/927° yy, =et/Moy, while vp > 
e-G/oo*y, = e(i/yy,. Indeed, the left and right handed fields rotate in 
opposite directions. 


In either the Dirac or the Wey] basis, (¥,). = y2(v7¥*)* = ¥. 


. It is easiest to work in either the Dirac or the Wey] basis. Let y be left handed, 


that is (1+ y)p =0. Then (1- y)y, = - yp )y?y* = 701+ y)y* = 0 
since y° is real. 


O. eCy pe FOO Ce wr yy = WC since (o”)TC = —Co*?. 


1 3 


Under parity or reflection in a mirror, x! + x! and x? — —x?. Choose y® =o?, 
yoy! =a!, and y°y? =o?. Multiply the Dirac equation (iy “a, — m)y = 0 by 
y® and write [i (89 + y°y/8;) — y°m]p = 0. Then multiplying by o! reverses the 
sign of the 9, term, but also the mass term. I leave it to you to discuss time reversal. 


Apply Noether for the transformation 
yy > ey =(1+id)y 


Then 


bh bh 
—-d5y+ = 
50,0) * 50,0) 


dy = iy" Oy) 


Note that formally © does not depend on aw. Thus, up to overall factors we can 
choose J" = wy“ with the corresponding charge Q = f d°xyy°y into which 
we plug (II.2.10) 


d°p 


— nee se —ipx t ipx 
vay= f Qn) V2(E,/myi? Le. s)u(p, sje?" + d'(p, s)u(p, sjen"| 
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At this point, the calculation pretty much parallels what you did in Exercise 
(1.8.4). The integration f d°x over space produces a delta funciton that sets 
the momentum variables in y and in y equal to one another. The new feature 
here is that we encounter objects such as uy °u. Invoking Lorentz invariance and 
referring to the rest frame form of u and v we haveu(p, s)y“u(p, 5’) = 5,.p"/m, 
u(p, s)y"v(p, s’) = 0, and so on. We obtain 


t t 
o-( 3 Fay LUN P, 007. 5) + dp. NaC. 9) 


> 


As in Exercise (1.8.4) we have to move the creation operator a‘ to the left of the 
annihilation operator d and subtract off an infinite constant. Thus, finally 


o- |? Ejay LOND. Nb. 8) — ap, Nap. 8) 
showing clearly that 5 annihilates a negative charge and d a positive charge. 

To calculate [Q, ¥(0)] = f d?xIv(x)y°v(x), ¥(O)] we use the identity 
[AB, C]= A{B, C} — {A, C}B and the canonical anticommutation relation 
(11.2.4). We find [Q, ¥(0)]= —¥(0), thus showing that b and d? must carry 
the same charge. 


The desired equations are y“W,,, = 0 (this takes out 4 components since o takes 
on 4 values) and (p — m)P We, = 0 (for each p this takes out 2 components and so 
altogether 4 x 2 = 8 components.) Thus, 16 — 4 — 8 = 4 components as desired. 
Another way of saying this is that y“W,,, is a Dirac spinor and hence the spin 5 
part of the vector-spinor WV, , 


Referring to (C.11) we see that in do the factor 


1 1 1 


ToS oe: ——__ (2n* 
fi, — WIE(P)E(P2) QrVEKk) — (27 )PECKn) 


reduces to 55 (m/E)*[1/ (2n)"] since S = 3. Integrating the factor d3 P,d° P,5 
(Py + po - P, — P,) over P, we knock off 3 of the delta functions, leaving 
us with dQdP,P?5(2E — E,), and so the integral over P, gives dQ} E?. Fi- 
nally, the factor containing the “real physics” is + )> > [M|? = (e4/4m*) f@). 
Multiplying the three factors together and dividing by aQ, we obtain da /dQ = 
(4)°[e* /(27)710/E7) f @), as given in the text. Note that m cancels out as ex- 
pected. We should be able to take the limit m — 0 compared to the energies 
without the cross section either blowing up or vanishing. 


Part II 


The amplitude should become nonanalytic when both denominators of the in- 
tegrand [1/(k* — m? + ie)]{1/[(K — k)? — m? + ie]} blows up, namely when 
k* =m? and (K — k)? = m7. But we found the condition in Exercise (1.7.5), 


11.1.3. 


T.2.1. 


11.3.4. 


11.5.2. 


11.5.3. 
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namely that K* > 4m”. Referring to (I0.1.15) 


ws if 2 
M = ix i da log || 
3277 Jo a(1—a@)K2 —m? + ie 


we see that the log has a cut starting at K? = m?/a(1 — «). As a ranges from 0 
to 1, the minimum value of m*/a(1— a) is attained at a = 5. So indeed, the cut 
starts at K* = 4m’. 


Under the indicated change, log A > log e®A = log A +6, and so 6M = 
~idd + iCA23(2e) + O(d9). Thus, 5M = 0 implies 54 = 6C A722 + O(A) = 
6CA75 log A + O(A>) giving the stated result for A(di./d A). 


For f d¢x(ag)* to be dimensionless, we need [p] = (d — 2)/2. Thus [y”] = 
n(d — 2)/2 and so in order for f d4x),,p" to be dimensionless, we must have 
[An] = n(2 — d)/2+d. 


. When we set m = 0, the integrand is manifestly a linear combination of y 


matrices. The integral cannot produce a term independent of the y matrices, which 
is what B is. For electrodynamics, the integral is changed to 


9, d*k 1 kyky p+ K+m 
232 all _ weve i v 


= A(p’) p+ B(p’) 


When m = 0, the integrand is a linear combination of the product of three y 
matrices, which can only reduce to one y matrix, not to none. Incidentally, 
an alternative way of seeing the results stated here is to recall from Chapter 
11.1 that with m = 0 the Lagrangian is invariant under the chiral transformation 
ye! Oy 


This essentially follows from D = 4 — By — 3 Fr for By = 0 and F; = 2. Then 
D=1 but the linear divergence is reduced to logarithmic divergence by the 
symmetry argument given in the text. 


Basically, you have already done this problem in Exercises [1.1.3 and II.1.4. You 
merely have to replace E and p by 0/dt and V (see also chapter III.6). 


In nonrelativistic quantum mechanics, the scattering amplitude is given in 
the Born approximation by i times the Fourier transform of the potential: 
i f d°xe'**U(%). The scattering amplitude owing to the exchange of a scalar 
meson of mass m is just i/(k? — m2) ~ —i/(k* + m?). Thus, we just repeat the 
calculation in Chapter I.4, obtaining 

ak eikk 1 


eo 
(27)3 k? +. m2 Amr 


—mr 


UGx)=- 
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u(p’)( py" + vy" pyu(p) = 2mu(p’)y*u(p) by the equation of motion, but us- 
ing y4y” = Hy, vy} + Ay4, y")=n” — io” we can also write (py + 
y! py =(p' + p)* +io*"(p’ — p),. We thus obtain the Gordon decomposition. 


0.6.2. We compute 


ny 

quit (p'y" F\(q?) + ar Fy(q?)lu(p) = a(p’) du(p) F\(q’) 
=a(p’)(p' — pup) F\(q’) 
= ai(p')(m — m)u(p)F,(q”) = 0 


where the first and third equality follows from the antisymmetry of o and the 
equation of motion, respectively. 


. Proceeding as in the text but living in d—dimensional spacetime, we obtain 


N 
iM, (q) =—i f oe where 5 = =f) dat with D = (2 + a(1 — aq? — 
m* + ig)” as before but with N,,, now effectively equal to —d((1 — 3) Bw + 
a(1—o@)(2¢,4y — uv?) + mB) Rotating to Euclidean space we see that 
we have 0 do the integrals (with c? =m’ — a(1— a)q”) S aia oe eye and 
f se os = f et, Te ~ cf eel las. I did the first of these in- 
tegrals for you in appendix II in choate I. 1. Generalizing slightly we have 
_ _ 1 —¢ 

fo. ali4 pe = pct 74 f dx(1 — x)7~1x2-!-§ | 1 will let you carry on 
from here. 


Part IV 


. Write @ = (9), &, ..., v + ¢)y). We compute 4u?G¢ — (A/4)(¢7) up to O(¢) 


and find (upon dropping the ’) 
> A > 
su (wv + 2vgy + ¢) — 7 (vt + 4u°gs, + 4v°gy + 20°") 


The condition of no linear term in gy fixes v? = 2”/A and so the coefficient of 
¢@” is equal to 44.7 — 4/4(2v?) =0. The (N — 1) fields 91, g2, ...,¢n_) are 
massless. 


. Wehave {5° dk log((k? +a°/kh= = ma and so Vig(¢) = Vig) + f/ Vp) /2 + 


O(h?). For L = 5(8g)" — "gy, the quantum oscillator with g identified as 
position, we have Vag (0) = Vo 


IV.3.3. We have 


: 2 2 
m(p) = fein Vp) =2i f £2 log =H 


Iv.3.4. 


IV.4.1. 


Iv.5.1. 


IV.5.2. 


IV.5.4. 


IV.5.6. 
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which after Wick rotation becomes 


2 2 +m 2 oo 2 
d Pe log PE i (y) --+ | dx log 21" 
(21) Pi: 2m Jo x 
After cutting off the integral at A? and adding a counterterm By" we obtain 


Ve = H(fo)?log & 

2x M2 
Vere =i 2, f atk /(2x)41/2n) LV") / 71". For V"(g) = 39? the corre- 
sponding Feynman diagrams consist of a circle with n V’s attached to the cir- 
cumference, where the 2n is the infamous symmetry factor that I tried to avoid 
talking about in Chapter 1.7. 


With A an arbitrary p-form, 


a= (p+D! pie Bete. sig dX ax dxdt? ...dx"p 


—[9,, Oy Ay yyy...upAxdx'dxtdx'? ...dx#e =0 


If you have done all the exercises thus far (see Exercise 1.9.3), you have al- 
ready made the acquaintance of the J = 2 scalar field transforming as g?° > 
R% R°4 gcd — Ryd (RT)4> — (RepR*)”, which thus can be written as a trace- 
less 3 by 3 symmetric matrix g > RgR*. Now you merely have to write out 
the covariant derivative D,,g (see IV.5.20) explicitly, [Hint: The action of the 
generators on ¢ is similar to what is shown in (B.20).] 


dF =d(dA+ A”) =dAA — AdA and [A, F]= AdA — dAA and so dF + 
[A, F]=0. Explicitly with indices, this reads e“""" (8, F,, + [A,, F,,]) =0.In 
the abelian case, we have, for u = 0, e/* 3, F, = V - B =0 (recall Chapter IV.4!), 
and for pp =i, eJ*(—ag Fj, + 8; Foy — 8) Fy) = —4pB; + (V x E); =0. 


From the general arguments mentioned in the problem we know that tr F? must 
be the “d of something.” Now d tr AdA = tr dAdA and d tr 3A? = 3 tr(dAA? — 
AdAA + A7dA) = 2 tr dAA* but on the other hand tr F* = tr(dA+ A*)(dA + 
A’) = tr(dAdA + 2dAA’) since t At =tr APA =— tr AA? =—-— tr A*=0. In 
electromagnetism, tr A? = 0, and d tr Ad A when written out in elementary nota- 
tion is just 0, (e#*" Ad, A) = fete F Fy 


We simply plug in the general expression in the text and obtain 


1 —,. 
L= apter™ + q(iy"D, _ m)q 
with the covariant derivative D, =9,,—iA, =, —i Al T*, where T? (a = 
1,..., 8) are traceless hermitean 3 by 3 matrices. Explicitly, (A,,q)* = 
Aa (T*)5q?, with a, 8 = 1, 2, 3 (see Chapter VII.3). 
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IV.6.3. Observe 

3 1~i2 
Afri= AL, Ai, 
i“ Alte —A? 
us ue 

with the obvious notation A[E? = Al + iA. Let (yg) = (°) so that 

a 1 i gal? 
Do =3,9 —i(gA* 5 + g'B,~ + -io( H 
2? = Iu — (8A, “5 + BBs )e 2” \ ga? +9'B, 


Thus, |D,,g|? contains v?[g?A +741? + (—gA® + g'B,,)”]. The combinations 
Ane, Al”, and (-gA3 + g’B,,) acquire mass while (g’A® + ¢B,,) remain 
massless. ° 


IV.7.4. We have 


dip N#Y 


(27)4 D + {u, ky oY, ka} 


A#’(k,, ky) =i 


where 
N#Y =tr(y°(p ~ f+M)y"(b- he t+ M)y*(p + M) 


Only the term linear in M in N“” does not vanish, giving N¥” = 41 Me#"°" ky ko,. 
Since we are interested only in terms of O(k,k,) we can set D -» (p” — M7)? so 
that 


d‘p 1 i 


SF __- __ = eth k 
(Q1)4 (p?— M2)3 42M lows 


AM (ky, ky) = 8M ek kor 


with a dependence on M as stated in the problem. The effect of the regulator, like 
some unsavory acquaintance, remains even after we have sent him to infinity. 


IV.7.5. We will sketch the solution. The details may be found in the lectures given 
by S. Adler at the 1970 Brandeis Summer School. The point is to imagine a 
regularization scheme that preserves the various relevant symmetries, namely 
Lorentz invariance, vector current conservation, and Bose statistics. As you will 
see, we don’t actually have to specify the regularization. By Lorentz invariance, 
we have 


AM (ky, ky) = ek Ay + aaaad oF.) + eM Kok) As 
+ eT ka ky Ag + 0 ki ghar kt’ As 
A 
+E "Thi gkacks AG + BHT Kk kockt Ay 
+ ehh kopks Ag 


Since the Feynman integral representing A*“” is superficially linearly divergent, 
we see that A3,..., Ag are all convergent since we have to pull out three powers of 


V.1.1. 


V5.1. 


V.6.1. 
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momentum to extract them. In contrast, A, and A> are logarithmically divergent. 
But we can relate them to A3, ..., Ag by vector current conservation since 
0= ky, A*¥” = EPOTR ky (—Ay + ke As + ky, - ky Ag) and thus A> = kt As +k, - 
k, Ag. Similarly for A,. Rationalizing the Feynman integrand and evaluating the 
trace in the numerator, we can systematically ignore terms that contribute only to 
A, and A>. Furthermore, Bose statistics gives us relations such as A3(k}, ki, q?) = 
—Ag(k3, ki, q”). 


Part V 


We dropped the term 47496 but kept the term 4g*6h?. This requires 899 < g°/, 
that is, w < g*A, but since in our solution w ~ g./p/mk this requires k < 
g./mp, which is consistent with what we assumed about k. Looking at the terms 
—2./phaod — 4g? ph? in L we see that h ~ 0/(g2./6) < /p, which is also 
consistent. 


With y° = 93, $I + y>) clearly projects out the top and bottom component of 
v= ( ye ), respectively. Everything is formally the same as in Chapter II.1, but 
we can also work things out explicitly in the specific representation given here. 
Thus, Py = Woy =i. — We) and by by = Wlopash =i(WRde + 
Wie). Under the transformation yy > ely, Wy > ef? w, and Wr > e7 wr, 
and the massless Dirac Lagrangian 


a a a a 
L=ing(— + vp —)e tiv} (— — op — 
YR Up wR ada Up VE 
clearly does not change. 


This of course just follows from Lorentz invariance. We have 


and 


and thus the equation 


x — vt 
(8? — 87) 9( 


becomes 
x—vt x—vt 


V1—v2 V1—v2 


Note that this does not depend on the form of V. For any relativistic theory, the 
soliton moves like a relativistic particle (obviously!). 


e"( 1+ V'Tp( 


)]=0 
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The sine-Gordon theory has an infinite number of vacua occurring at 9 = 
(2n + 1)x/f. Thus, there exists a whole spectrum of solitons, such that (+00) = 
(2n4. + 1)a/B. The topological current is J* = (8/22 )e"" 4,0 with the corre- 
sponding charge Q = (n, —n_). The Q =2 soliton decays into two Q =1 
solitons. 


Gi/2m) for gdg! = G/2x) for ede”? = (i/2m) f(—ivdd) = (v/2m) 
fe * d@ =v, which indeed counts the number of times e”? winds around the 
circle. What mathematicians call the winding number is indeed just the magnetic 
flux of the physicist. 


Within a region small enough so that we can treat g* = vd? as constant, using 
(D,,9)? = 4,9? + es AS of we have (D,,)' = evar and (D,,9)* = —evAl 
and thus 
_ Fin?” /ede*”9*(D,9)(D, 9) 
oI ll 


3 2,41 241, _ 3 3 
> Fit e(A A, - AA) = 0,4) — dA; 


precisely the electromagnetic field strength since At is the massless component 
of the Yang-Mills field. Let us compute By = €;;,5; j far from the magnetic 
monopole. To calculate the magnetic charge we are interested only in the term of 
order 1/r? in B. Since Dio > O(1/ r*) by construction we can drop the second 
term in F;;. Thus, we merely have to compute Fy = 9; A“ — 9jAp + eth AP AS. 
Since Fi, will eventually be contracted with the unit vector g*/|g| = x°/r, we 
can effectively drop some of the terms in Fi, thus simplifying the computation. 
We have 


1 oj, 2! I. 1 
. a _ (o aji*_ 6 afi 
0, A;; dre “2? 5f 2 
and 
be pbim pcjnym yn 
abc b,c _. Cfeyet?e?lMednx™ x 
_ Afey(6%5" = 8B ex” OL sin arn 
r4 er‘ 
so that 
Fegt Fax? . , 
nije ee J 24 Det x? = _ 1 aij ,a 
ly| r er? er? 


and hence B, = —(1/er?)x*. The magnetic charge g = —42r/e. 

Our result appears to differ from Dirac’s quantization condition (TV.4.10) by 
a factor of 2. The resolution of this apparent paradox is instructive. In fact, we 
can always introduce into this theory a filed YW (which could be a Bose or a Fermi 
field) transforming in the J = 3 representation with the corresponding covariant 
derivative D,,V = 0, — ie(Zt*)AGw, The field W carries electric charge se. 


V.7.7. 


V.7.8. 


V.7.9. 


V.7.11. 


V.7.12. 


V.7.13. 
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Thus, the fundamental unitof electric charge is actually 5, not e, and our result 
g = —4n /e = —20 /(e/2) is actually nothing but the Dirac quanization condition. 
(The sign is trivial: just a question of which one we call the monopole and which 
the antimonopole.) 


Plugging in the Ansatz gy? = (H(r)/er)(x*/r) and A? =[1— K (rye! (x4 /er?) 
[so that H (r) ane and K (r) = 0 in accordance with the asymptotic behav- 
ior (V.7.5) and (V.7.6)] into M = f d3x{1(F,;)" + 4(D,@)? + V@)} we get M as 
a functional of H and K. Minimizing M gives (with H’ = dH /dr etc) the equa- 
tions r?H” = 2H K? + (A/e*)[H? — (ev)*r?HJandr?K" = K(K? —1)+ KH’. 
For help, see M. K. Prasad and C. M. Sommerfeld, Phys. Rev. Lett. 35: 760, 1975. 


The BPS solution corresponds to setting 4 = 0 in the two equations in Exercise 
V.7.7, rendering the equations soluble, with the solution H (r) = evr(coth evr) — 
land K(r) = evr/(sinh evr). Ask yourself why H(r) and K (r) approach their 
asymptotic values exponentially. What determines the length scale? 


For help, see B. Julia and A. Zee, Phys. Rev. D11: 2227, 1975. 


We derived the lower bound for the mass of the magnetic monopole 42 v|g| ~ 
4n(ev)/e* ~ My /a. 


Near the identity element g = elf Fw 1+i6-6 and thus gdg' ~ —id6 -G. 
In a small neighborhood of the identity element the group manifold is locally 
Euclidean and so 


tr(gdg')’ =i tr(o'a/o*)do'deido* = —12d0'd6"d0? 


is manifestly proportional to the volume element on S3. For g = e! 4191 +8202+™ 9303), 


ir(gdg')? = —12md6'de7de3 


Sf atx8,JL) = f Px larco — f Ax Ri oo- Recalling that J? = ¥Rvp — 
Vv ; Wj , we see that the two spatial integrals just count the number of right moving 
fermion quanta minus the number of left moving fermion quanta at t = -+oo 
respectively. So f d4x (8,05 ) is an integer. On the other hand, in the text we 
proved that f tr F 2 is a topological invariant. In other words, with suitable 
normalization, evidently 1/(47)*,the integral [1/(42)7] f d*xe#™? tr Fi Fig 
is an integer. Thus, the coefficient 1/(4sr)* cannot be shifted even a little bit by 
quantum fluctuations. 


Part VI 


The quartic interaction term (1/2f7)(z -82)* in & gives the amplitude 
i (1/2 f7)i780?5°4 (kjky + kykg + kok; + kok,) + permutations = (i /2 f7)82°5°4 
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(ky + kz)? + permutations for the 4-pion interaction vertex (where for conve- 
nience we have labeled all the momenta as going outward so that kj + ky +k34+ 
k4=0). 


After writing 0 = v + 0’, we find, as in Chapter IV.1, that £ = —5(2p2)0” ~ 
hvo’nt? — iMG)? +--+, where we have displayed only terms relevant for our 
purposes. The diagrams contributing to four-pion interaction are of two types, 
those involving the A(7*)? term and those invoking a’ exchange. The former 
gives for the amplitude (—{A)2 . 2(595°4 4 §9¢8>4 4 5945c) while the latter 
gives (—iAv)*{2i/[(ky + ky)? — m2,]}6%78°¢. Thus, expanding to first order in 
momenta squared we find the coefficient of §9°3°¢: 


2 
— id — 2a | a4 Sth) 


m-, m2 


a a’ 


+4 2,,2 2 


IX 2 
= ——(k, +k)” 
2p? 2p? | 22 1+) 
To compare with Exercise VI.4.2 we remember that f* = v? = y7/A so that the 
amplitude here is also equal to (i /2 f7)87°54(k, + k,)? + permutations, as we 
had anticipated in the text. 


We will track down factors of 2 carefully but not factors of i and —1. Let us go 
back to the chiral transformations y > [1+ i6 - (2/2) y >| andy > [1+ i6- 
(%/2)y>]. Thus, (Ww) = O* Wiy>t4w and S(Wiy?t*W) = —0% pW. Hence, for 
L=wf{iyd+e(o +it -Hys)}W + L(c, 7) to be invariant we must have do = 
6°x* and 6° = —6%o. Applying Noether’s theorem J,, = (6£/63,p)dy, we 
obtain the current Ji, = wiy,ys(t7/2)W + 2°0,0 — 04,7 written in the text. 
Comparing the term piy,,/5n contained in Jee = J}, +iJ7, with the current 
Js,, defined in Chapter IV.2, we see that Js,, = —is it? The normalized state 
|a~) = (1//2)([a!) — i far)) sothat (O|x'+? |[z-) = 2//2. The current J)4?? 
contains the term —v8,alt ? and thus f = /2u. Next, we have to work out the 
pion-nucleon coupling g,, yy as defined in Chapter IV.2. Here £ contains gwit - 
ityswW, which contains /2g piysna~ since 1!~!? = ./22~. Thus, 8nnn = Vv28. 
Putting it together, we see that M = gv translates to 2M = fg, ny in agreement 
with Chapter IV.2. 


See Figure VL.6.1. From Ah = (d/cos 0) ~ d(1+ 567) and (3h/dx) = tan ~ 0, 
we have (8h/8t) « 6? x (8h/8x)", thus giving rise to the term (4/2)(WA)?. It 
all goes back to Mr. Pythagoras. 


We integrate the term 4 f dx dt(((8/8t) — V2)AP? in S(h) by parts to obtain 
—4 f d?% dt[h((a/at) + V?)((8/8t) — V2)h]. Thus, the propagator is the in- 
verse of the operator (8/dt + V2)(8/3t —-W)= 92/at2 — (V2)?, the Fourier 
transform of which is —(w* + k‘). 
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With h’(Z, t) =h (% + git, t)+H-%4+ § ut, we have dh'/dt = dh /dt + gi - 
Vh + (g/2)u? and Vi! = VA + id. Thus the combination (84/t) — £(Vh)? is 
invariant, as is (obviously) V2hA. In other words, S(A) must be constructed out of 
these two invariant combinations. 


Look at the action S(h) = 5 f d?X dt [(@h/at) — V7h — g(Vh)*/27. Compar- 
ing 0h/dt — VA we see that time has the dimension of length squared: T ~ L?. 
From the term f dx dt (ah/a t)? and the fact that S is dimensionless, we have 
[AP ~ T2/(L°T) ~ 1/L?~? and so h has the dimension of (1/L2~2)2 . Compar- 
ing V2A with g(Vh)2 we see that g has the dimension of 1/h, that is, LP-2)/2, 


We are told that L(dg/dL) = (2 — D)g/2 + (2D — 3) fpg? +-+-. We are as- 
suming that the terms (- - -) can be neglected. Thus (in what follows a” and b? are 
two generic positive numbers) for D = 1, L(dg/dL) = a’g — b*g? and g flows 
toward the fixed point g* = a/b. (Incidentally, the KPZ equation is soluble for 
D =1 by methods not explained in this text and both z and x are Known ex- 
actly.) For D = 2, L(dg/dL) = b’g° and g flows toward some unknown strong 
(presumably) coupling fixed point. For D = 3, L(dg/dL) = —a*g + bg>. The 
fixed point g* = a/b is unstable. For g < g*, g flows toward the trivial (i.e., free, 
or Gaussian) fixed point. Since the theory at the fixed point is free we know the 
critical exponents exactly: z = 2 and x = (2 — D)/2. For g > g*, g flows toward 
some unknown strong (presumably) coupling fixed point. 


Part VII 
Using the BHC formula given, we have (the V’s are clearly irrelevant) 


U, Un = pidAy pidAy _ pitlAyt+Ay)—4a7[ Ay, Ay Har 404 D+ 0(a°) 
ij 


Similarly, 


_iaal - at _sarat ry_ 1,2 3 4 5 
Ui; —e iaA\, ,—iaA, —e ia(Ai+A)) 7@°[A, AyHavE+a°F +0 ), 


the prime reminding us that the A,, and A,, in this expression is to be evaluated 
on the “north” and “west” side of the plaquette in Figure VII.1.2, respectively, in 
contrast to the A,, and A, in U;;U ;, which are evaluated on the “south” and “east” 
side, respectively. Here C, D, E, and F denote various commutators, which we 
drag along merely to show that they eventually drop out in what interests us. (Note 
how the different terms are associated with different powers of a, as indicated. 
Note also that in some places we have dropped the prime on A and absorbed the 
“error” in doing so into terms higher order in a.) Thus, 


Uy = po talAyt+Ay)—ia? Ay Oy Ay— FilAy AyD +a +a4H +0) 
j= 


where G and H denote sums of commutators and terms such as 0,0,A,, and 
d,0,0,A,,- Applying the BHC formula again to the order indicated we have 
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UU Ui = ei BpAy— Ay)" [Ay A HOGS _ pia? Fy, +a +047 +0(@) 
with F,,, =9,A, — 0,A,, +i[A,,, Ay]. The same remarks on G and H apply to 
Z and J. The Yang-Mills field strength emerges naturally, as we would anticipate. 
Since the traces of commutators and of A vanish, when we apply the trace all the 
junk drops out to O(a>) and we have 


S(P) =Ret[i—- 4atF,,F,,+0@)] 


By gauge invariance, the corrections must be of even order in a but for our 
purposes we don’t care about them anyway. Evidently, f and g are related by 
some uninteresting factors of a. 


Part VIII 
R”? = dw'? = d(— cos @dg) = sin OdOdo = 2Rj,dodg = Rye = 5 sin @. 
Since ef = 1, eS = 1/sin@, we obtain R= Re elles = 2Rye1 ey = 1, inde- 
pendent of @ and 9 as expected. 
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classical limit, path integral formalism for 
taking, 19 


classical physics, symmetry of, 243 

Clifford algebra: and Dirac bilinears, 93; and 
Dirac equation, 90-91 

closed forms, 219 

coherence length, 271 

Coleman, Sidney, 224, 455 

Coleman-Mermin- Wagner theorem, 200 

Coleman-Weinberg effective potential, 211 

color, quark, 368, 369 

condensation, and superconductivity, 270 

condensed matter physics: critical dimension 
in, 345, 349; disordered systems 
studied by, 330, 334; goal of, 306; 
Goldstone’s theorem in, 199—200; 
impurities studied by, 334; length scales 
in, 154; momentum density in, 173; 
number conjugate to phase angle in, 
174; particle physics and, 255, 437- 
38; quantum field theory and, 5, 172, 
255; quantum Hall effect and, 331-32; 
quasiparticles in, 304; renormalization 
group in, 341-44; spin-statistics rule 
and, 117 

conductivity, vs. conductance, 347-48 

connected graphs, vs. disconnected graphs, 
27-29, 46 

conserved current: charge associated 
with, 74; and chiral symmetry, 96; 
and continuous symmetry, 73-74; 
momentum space version of, 131 

continuous symmetry, 71-72, 197; conserved 
current and, 73—74 

continuous symmetry breaking, 196; 
Coleman-Mermin-Wagner theorem on, 
200; and massless fields, 198-99 

Cooper pairs of electrons, 270 

coordinate transformations, 76-77 

cosmic coincidence problem, 436 

cosmological constant, 434—35; measured 
in units of Gev*, 435-36; order of 
magnitude of, 435 

cosmological constant problem, 435; 
approach to, 440; roots of, 434; string 
theory’s inability to resolve, 436; 
supersymmetry as solution to, 443 

Coulomb potential, modification of, 190 

Coulomb’s electric force: and Newton’s 
gravitational force, comparison of, 27; 
quantum field theory on, 30-31 

counterterms: cutoff dependence absorbed 
by, 212; in Feynman diagrams, 160-61, 
161f; nonrenormalizable theories and, 
164, 212 

coupling, electromagnetic, 339 

coupling constant(s), 148, 158; 
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dimensionless, 155; of electromagnetic 
interaction, 340; hadron proliferation 
and, 202; as misnomer, 340; pion- 
nucleon, 206; renormalized, 150; 
Yang-Mills, 231-32 

coupling renormalization, 158-59 

CPT theorem, 101 

critical dimension, in condensed matter 
physics, 345, 349 

critical phenomena, 267; complete theory 
of, 269; Landau-Ginzburg theory of, 
268-69 

current conservation. See conserved current 

curved spacetime: Dirac action in, 430; 
introduction to, 79-82; quantum field 
theory in, 77, 264 

cutoff dependence, 148; avoiding in physical 
perturbation theory, 161; counterterms 
absorbing, 212; disappearance of, 
150, 151; of meson-meson scattering 
amplitude, 158 


Dashen, Roger, 389 

Deser, Stanley, 453 

Di Vecchia, Paolo, 453 

differential forms, 218—19- use in nonabelian 
gauge theory, 228, 229 

differential operator, propagator as inverse 
of, 22 

dilation invariance, 79n 

dimensional analysis, 154—55, 438; on 
meson-meson scattering amplitude, 158 

dimensional regularization, 151, 152-53, 
187 

Dirac, Paul, 101; on electric charge, 
quantizing of, 394; on magnetic 
monopole, 285; metaphorical language 
of, 109; on path integral formalism, 
10-13; on positron, 5; on quantum 
mechanics and magnetic monopoles, 
217; on spinor representation, 114, 
teaching style of, 439 

Dirac basis, vs. Weyl basis, 95-96 

Dirac bilinears, 93 

Dirac equation, 89-102; Clifford algebra 
and, 90-91; in curved spacetime, 430; 
and degrees of freedom, reduction in, 
91; derivation of, 115; electromagnetic 
field and, 97; handedness and, 96; 
Lorentz transformation and, 92-93; 
magnetic moment of electron in, 177- 
78; origins of, 89-90; parity and, 94; in 
solid state physics, 273, 274; solving, 
95: time reversal and, 100-101 

Dirac field: interacting with scalar field, 


Feynman rules for, 53-54, 471- 
72; interacting with vector field, 97; 
interacting with vector field, Feynman 
rules for, 128, 128f, 472-73; propagator 
for, 126; quantizing, 103-10, 120; 
quantizing by Grassmann path integral, 
126; vacuum energy of, 107~8, 124 

Dirac operator, 109 

Dirac spinor, 90, 92; components of, 114; 
and supersymmetry, 111 

disorder: Anderson localization of, 330— 
31, 334; condensed matter physics 
and study of, 330, 334; Grassmannian 
approach to, 334 

dispersion relations, 207 

divergence(s): degree of, 161-63; with 
fermions, 163-64; logarithmic, 160, 
161-62; in quantum field theory, 56-57, 
145-46; superficial degree of, 161, 165; 
dependence on dimension of spacetime, 
165; total, under supersymmetric 
transformation, 447-48 

dotted and undotted notation, 113-14, 
479-82; replacing, 457 

double-line formalism, 378, 379 

double-slit experiment, 7, 8f; expansion of, 
7-9, 8f, Of 

double-well potential, 194, 194f 

Drell, Sid, 337 

dual theory, vortices as charges in, 310-12 

duality: in (2 + 1)-dimensional spacetime, 
313; concept of, 309, 310; 
electromagnetic, 222; and linking 
of perturbative weak coupling to strong 
coupling, 455; of monopoles, 312-13; 
nonrelativistic treatment of, 314—16; 
relativistic treatment of, 313~—14; of 
string theories, 313; vortex, 312 

dynamical symmetry breaking, 201; example 
of, 371 

dynamical variable, in quantum field theory, 
18 

dyon, 285 

Dyson, Freeman, 59 

Dyson gas approach, 384—86 


effective field theory: development of, 437; 
Fermi theory of the weak interaction as, 
441: of Hall fluid, 302-3, 437-38; of 
neutrino masses, 441; predictive power 
of, 441-42; of proton decay, 440-41, 
and renormalization group flow, 438 

effective potential, 209-10; Coleman- 
Weinberg, 211; generated by quantum 
fluctuations, 215 
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Ehrenfest, P., 427 

Einstein, Albert: and cosmological constant, 
435; and repeated indices summation 
convention, 457n 

Einstein Lagrangian, linearized, 32 

Ejinstein’s theory of gravity, 76, 78; 
and deflection of light, 425-26; 
nonrenormalizability of, 157; 
Yang-Mills theory compared with, 430 

Einstein-Hilbert action for gravity, 419-20; 
Newtonian gravity derived from, 424— 
25; Yang-Mills action compared with, 
420-21 

electric charge: quantized, grand unification 
on, 394; quantum fluctuations and, 188, 
189; renormalization of, 189, 190 

electric force: and gravitational force, 
comparison of, 27; quantum field 
theory on, 30~31 

electromagnetic coupling, flow of, 339 

electromagnetic duality, 222 

electromagnetic field: Dirac equation in 
presence of, 97; and energy of vacuum, 
65; as quantum field, 3—4; stress-energy 
tensor of, 78-79 

electromagnetic force: knowledge of, 434, 
between like charges, 31 

electromagnetic wave, degrees of freedom 
of, 35 

electromagnetism: Faddeev-Popov method 
applied to, 170-71; and gravity, 
unification of, 428; Maxwell on. See 
Maxwell theory of electromagnetism; 
weakness of, 398-99 

electron(s): absolute identity of, 117-18; 
binary strings in, 412; in condensed 
matter system, 255; Cooper pairs of, 
270; degrees of freedom of, 91, 95; 
effect of magnetic field on, 223-24; 
energy levels available to, 5; Fermi- 
Dirac statistics for, 117; fractional 
Hall state of, 301; magnetic moment 
of. See magnetic moment of electron; 
mass of, in classical physics, 166; 
noninteracting hopping, 273-74, 273f, 
274f; pairing into bosons, 270; pairs 
of, Bose-Einstein statistics for, 117; 
photon fluctuation into, 183-85, 184f; 
requirements of antisymmetric wave 
function, 103; stability of, 397 

electron scattering, 130-34, 131f; off 
nucleons, deep inelastic, 368-69; 
off protons, deep inelastic, 341; off 
protons, Schrédinger equation for, 3; to 
order e*, 136-40, 137f-138f, 139f 


electron-positron annihilation, 372-75 

electroweak theory, 155-56, 361; 
construction of, 361-65; 
renormalizability of, 366 

energy: fundamental definition of, 78; of 
mass, 34, quantum mechanics and 
special relativity on, 3; of vacuum. See 
vacuum energy 

energy density, 34 

energy scales: in particle physics, 154; 
renormalization group and, 342 

energy-momentum tensor, 297 

entropy, Boltzmann on, 287 

Euclidean path integral, 12 

Euclidean quantum field theory, 261-62; and 
high-temperature quantum statistical 
mechanics, 264; and quantum statistical 
mechanics, 263 

Euler-Lagrange equation, 13, 74, 424, 434 

exact forms, 219 


Faddeev-Popov method, 169-70, 240, 353; 
applying to electromagnetism, 170-71; 
and derivation of graviton propagator, 
423 

Fermi, Enrico, 101 

Fermi coupling, 155 

Fermi field, mass correction to, divergence 
of, 165 

Fermi liquid, gapless modes in, 259 

Fermi theory of the weak interaction, 

203; as effective field theory, 441; 
within electroweak theory, 156; 
intermediate vector boson and, 156—57; 
nonrenormalizability of, 155, 164, 366; 
predictive power of, 438 

Fermi-Dirac statistics, 117 

fermion(s): and bosons, unification of, 443; 
degree of divergence with, 163-64, 
Feynman rules for, 127-28, 127f; 
in lattice gauge theory, 358; mass 
correction for, divergence of, 165, 166; 
massive Dirac, and Chern-Simons term, 
297-98 

fermion masses: in grand unification, 401-2; 
naturally small, 404 

fermion propagator, 109 

fermion-fermion scattering, Feynman 
diagram for, 156, 157f 

ferromagnet(s), 199; effective low energy 
description of, 322-23; low energy 
modes in, 323-24; magnetic moments 
in, 322; order in, 306 

ferromagnetic transition, 270 

Feynman, Richard: on Dirac, 101; main 
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contribution of, 41; metaphorical 
language of, 109; on path integral 
formalism, 7-10; Yang-Mills theory 
and, 353 

Feynman diagrams: analyticity properties of, 
189; beginning of, 27, 28f; breaking 
shackles of, 287; canonical formalism 
and, 42; childish game generating, 
52, 52f; connected vs. disconnected, 
27-29, 46; counterterms in, 160-61, 
161f; discovering, 42-49, 43f, 44f, 
45f, dominance of, 277; for electron 
scattering, 130, 131f; evaluating, 475— 
77; for fermion-fermion scattering, 156, 
157f; finite temperature, 263; function 
of, 49; limitations of, 67; loop, 44, 
56-57, 56f, 58f; in momentum space, 
53; orientation of, 53; path integral 
formalism and, 42; in perturbation 
theory, 54, 55f; regularization of, 
alternative ways of, 151; relating 
infinite sets of, 206-7; in spacetime, 53, 
57 

Feynman gauge, 141 

Feynman rules, 471-74; discovery of, 59; for 
fermions, 127-28, 127f; in nonabelian 
gauge theory, 473-74; in physical 
perturbation theory, 160-61, 161f; for 
quantum electrodynamics, derivation 
of, 135-41; in random matrix theory, 
381, 381f; for scalar field, 53-54, 471- 
72, for spontaneously broken gauge 
theories, 239-40; for vector field, 128, 
128f, 472-73; in Yang-Mills theory, 
230, 231f 

field renormalization, 160 

field strength, construction of, 228-30 

Fierz, M., 118 

Fierz identities, 442 

Fisher, Matthew P. A., 269, 314n; and 
renormalization groups, 342 

fixed point(s), strong coupling, 340 

flux: fundamental unit of, 302; gauge 
potential and, 312 

force: origin of, 27; particle and, 25-27. See 
also specific force 

forms: closed vs. exact, 219-20; geometric 
character of, 222-23, 223f 

fractional Hall effect, 301-2 

fractional (anyon) statistics, 293; coupling to 
gauge potential, 294-95; gauge boson 
and, 299; misleading nature of term, 
295; and quasiparticles, 305, 306 

freedom, degrees of, 35; canonical formalism 
and, 67; Dirac equation and reduction 


in, 91; of electron, 91, 95; gauge 
invariance as redundancy in, 241; 
longitudinal, in massive gauge field, 
237 
free-field theory (Gaussian theory), 21-22, 
41: in terms of Fourier transform, 24 
Fujikawa, Kazuo, 252 


gamma matrices, 90, 114, 475; products of, 
91-92; trace products of, evaluating, 
132-33 

Gamow, George, 117n 

“Gang of Four,” 347 

gapless mode, 258; Bogoliubov calculation 
of, 258, 260; linearly dispersing, 
258-59 

gauge boson(s): and fractional statistics, 299; 
and intermediate vector boson, 285; 
mass spectrum of, 238-39 

gauge fixing, 168 

gauge invariance, 79n, 135, 457; of 
Chern-Simons term, 306; and Dirac 
quantization of magnetic charge, 220; 
discovery of, 135n; in lattice gauge 
theory, 358; in nonabelian gauge 
theory, preserving, 187; origin of, 
168; proof of, 136-41, 186-87; as 
redundancy in degrees of freedom, 
241; regularization respecting, 
185-87; and renormalizability, 
395 

gauge potential, 223; flux associated with, 
312; fractional statistics and, 294-95; 
in Hall fluid, 303-4, 307; nonabelian, 
227, 228 

gauge theory(ies): Faddeev-Popov 
quantization of, 169~70, 240, and fiber 
bundles, correspondence between, 
229; gravity as, 422; lattice, 356— 
58; redundancy in, 169-70, 171, 
spontaneously broken, magnetic 
monopoles in, 285; spontaneously 
broken, Feynman rules for, 239- 
40; and superconductivity theory, 
271; symmetry breaking in, 236— 
38, 241, 271; unsatisfactory 
formulation of, 456; vortex in, 
283. See also nonabelian gauge 
theory(ies) 

gauge transformation (local transformation), 
227; and general coordinate 
transformation, 428 

Gaussian integration, 13, 459 

Gaussian theory (free-field theory), 21-22, 
41; in terms of Fourier transform, 24 
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Gell-Mann, Murray: on quark color, 368; 
and seesaw mechanism, 410; o model 
of, 318-19; SU (3) of, 469 

Gell-Mann matrices, 238 

general coordinate invariance, 34 

general coordinate transformation, and gauge 
transformation, connection between, 
428 

general covariance, principle of, 76 

general relativity: and quantum mechanics, 
marriage of, 6; review of, 79-82 

generator, charge as, 74 

Georgi, Howard, 405n 

Georgi and Glashow, grand unification 
theory of, 391-93 

ghost fields, 354-55 

Ginzburg, V., 237; on London penetration 
length, 271; on second-order transitions, 
267; on superconductivity, 270 

Girvin, Steve, 306 

Glashow, Sheldon, 155; electroweak theory 
of, 365; grand unification theory of, 
391-93; Yang-Mills theory and, 361 

gluon(s), 369; origins of concept, 207 

Goldberger, Murph, 101 

Goldberger-Treiman relation, 206, 207, 320 

Goldstone’s theorem, 198-99; in condensed 
matter physics, 199-200 

Golfand, Yu. A., 443 

Gordon decomposition, 178 

Goto, T., 452 

grand unification, 437; charge conjugation 
in, 413-14; and deeper understanding 
of physics, 394-95; fermion masses 
in, 401—2; and freedom from anomaly, 
395-96, 414; and hierarchy problem, 
403-4; need for, 391; and origin of 
matter, explanation for, 403; and proton 
decay, 397-98, 400, 441; SO (10): 
antineutrino field in, 409-10; binary 
code in, 411-12; spinor representation 
of, 405-7, 409, 410; SO (18), 413; SU 
(5), 469; SU (5), Georgi and Glashow 
theory of, 391—93; triumph of, 399-400 

Grassmann integration, 124-26 

Grassmann number(s), 121, 124; in path 
integral for spinor field, 122 

Grassmann variables, 218 

Grassmannian symmetry, 335 

gravitational force. See gravity 

gravitational interaction, 35 

graviton: coupling to matter, 421; definition 
of, 78; as elementary particle, 420, 
434; force associated with, 27; in 
(n + 3+ 1)-dimensional universe, 


graviton propagator, 423—24 

gravity: Einstein on. See Einstein’s theory 
of gravity; Einstein-Hilbert action 
for, 419-20; and electromagnetism, 
unification of, 428; as field theory, 
420-22; as gauge theory, 422; helicity 
structure of, 431-32; of light, 427; 
Newton on. See Newton’s gravitational 
force; nonrenormalizability of, 421, 
weak field action for, 422-23 

Green’s function(s), 45, 332; generating, 49; 
propagator related to, 22 

Gross, David, 369 

Gross-Neveu model, 386-88 

ground state, in quantum field theory, 196 

ground state degeneracy, 297 

group theory, review of, 461-70. See also 
special orthogonal group SO (N); 
special unitary group SU (N) 


hadron(s): electron-positron annihilation 
into, 372-75; in electroweak unification, 
365; experimental observation of, 202; 
quarks as components of, 368 

Hall effect, 331-32; fractional, 301-2: 
integer, 301 

Hall fluid(s), 300-308; Chern-Simons term 
for, 304; effective field theory of, 302— 
3, 437-38; electron tunneling in, 307; 
five general statements/principles of, 
302, 303, 307; gauge potential in, 303- 
4, 307; incompressibility of, 301, 306; 
Laughlin odd-denominator, 306; order 
in, 306 

handedness, field, 96; charge conjugation 
and, 98 

Hansson, T., 302 

harmonic paradigm, 5 

Hasslacher, Brosl, 389 

Hawking radiation, 264-65 

Heaviside, O., 217 

hedgehog, 284 

Heisenberg, Werner: approach to quantum 
mechanics, 61-62; isospin SU (2) of, 
469; isospin symmetry of, 370, 371; on 
neutron and proton, symmetry of, 71 

hierarchy problem, grand unification and, 
403-4 

Higgs field, covariant derivative of, 238 

Higgs particle, mass of, 366 

high energy physics: renormalization group 
in, 340-41 

homotopy groups, 283 

Hopf term, 296; non-local, 307 

Howe, P., 453 


39_40; self-interaction of, 420; spin of, 34. Hubbard-Stratonovich transformation, 174 
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identity, absolute, 117-18 

impurities, 301; condensed matter physics 
and study of, 334; and random potential, 
330 

infinities, in quantum field theory, 145-46 

instanton(s), 286; discovery of, 455 

integer Hall effect, 301 

integration measure, in path integral 
formalism, 67 

integration variables, shifting, 245 

interchange symmetry, 71 

internal symmetry, 7 1 

inverse square law, 38 

irrelevant operators, 345 

Ising model, 342 

isospin symmetry of Heisenberg, 370, 371 

Itzykson, Claude, 385 

Iwasaki, Y., 426 


Jona-Lasinio, G., 208 

Jordan, P., 103 

Josephson junction, fundamental relation 
underlying, 174 


Kadanoff, Leo, 342; and renormalization 
groups, 342 

Kaluza-Klein compactification, 427-28; 
derivation of, 432-33 

Kardar-Parisi-Zhang equation, 326 

kinks. See solitons 

Kivelson, Steve, 302 

Klein-Gordon equation, 21, 89, 91, 172; 
Schrédinger equation derived from, 172 

Klein-Gordon operator, 109 

Kosterlitz-Thouless transition, 287 

Kramer’s degeneracy, 100 


Lagrangian: Dirac. See Dirac equation; 
gauge invariant, 227-28; Maxwell. 
See Maxwell Lagrangian; Meissner, 
310, 313; as mnemonic, 318, 320; for 
quantum electrodynamics, 97, 135; 
symmetries of, breaking, 193; weak 
interaction, 96; Yang-Mills, 230 

Lamb shift in atomic spectroscopy, 190 

Landau, L. D., 237; on London penetration 
length, 271; on second-order transitions, 
267; on superconductivity, 270; on 
superfluidity, 258 

Landau gauge, 141 

Landau levels, 301 

Landau-Ginzburg approach to quantum field 
theory, 17 

Landau-Ginzburg theory (mean field theory), 
268-69; order in, 306 

Laplace, P.-S., 264 


large N expansion, 377-79; Dyson gas 
approach to, 384-86; field theories in, 
386-88 

Larmor circle, 300, 301 

lattice gauge theory, 356-58; Wilson loop in, 
358-60 

Laughlin odd-denominator Hall fluids, 306 

Lee, B., 158 

Lee, D. H., 314n 

Lee, Tsung-dao, 96 

Legendre transform, 209 

Leinaas, J., 293 

length scales: in condensed matter physics, 
154; renormalization group and, 
342-43 

leptons: families of, 366; generations of, 413; 
and quarks, neutral current interaction 
between, 366 

Lévy, M., o model of, 318-19 

Licciardello, D., 347 

light, gravity of, 427 

light beam, stress-energy tensor of, 430 

Likhtman, E. P., 443 

linearly dispersing mode, 258; velocity of, 
259 

local field theory, 456 

local transformation. See gauge 
transformation 

localization: Anderson, 330-31, 334; 
Anderson, in renormalization group 
language, 347-49; study of, 335 

logarithmic divergence, 160, 161-62 

London penetration length, 271 

loop diagrams, 44, 56-57, 56f, 58f 

Lorentz algebra, 112-13 

Lorentz boosts, 111-12 

Lorentz group: defining representation of, 
113; generators of, algebra for, 112-13; 
spinor representation of, 113-15 

Lorentz invariance, 457; canonical formalism 
and, 67; Euclidean equivalent of, 343; 
in quantum field theory, 17 

Lorentz transformation, and Dirac equation, 
92-93 

Liders, G., 118 


MacDonald, Alan, 306 

magnetic charge (monopole), 222, 285; 
confinement in superconductor, 369— 
70; Dirac quantization of, 220-21, 
224; duality of, 312-13; electrically 
charged (dyon), 285; mass of, 285; 
and Maxwell’s equations, 222; 
quantum mechanics and, 217; in 
spontaneously broken gauge theory, 
285 
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magnetic moment of electron: anomaly 
in, 179; calculation of, 439; in Dirac 
equation, 177-78; Schwinger on, 
179-81, 439 

magnetic moment of ferromagnet and 
antiferromagnet, 322 

magnetic moment of proton, anomaly in, 
439-40 

Majorana, Ettore, 98n, 481 

Majorana equation, 98-99 

Majorana mass, 98; for neutrino, 99 

Majorana spinor, 99, 481 

marginal operators, 345 

mass(es): attraction between, 33-34; of 
electron, 166; energy of, 34; of gauge 
boson, 238-39; of Higgs particle, 366; 
of magnetic charge (monopole), 285; 
Majorana, 98, 99; of neutrino, 99; of 
nucleon, 319; Planck, 39-40, 420; of 
soliton (kink), 280 

mass renormalization, 159 

massive gauge field, Nambu-Goldstone 
boson and, 237-38 

massive spin 1 field, vs. massless spin 1 field, 
168 

massive spin 1 particle: degrees of freedom 
of, 35, 36; degrees of polarization of, 
32; field theory of, 30-31; propagator 
for, 33; rest frame for, 168; in 
Yang-Mills theory, 361 

massive spin 2 particle: degrees of 
polarization of, 33; propagator for, 33, 
426 

matrix (matrices): gamma. See gamma 
matrices; Gell-Mann, 238: Pauli, 238; 
without inverse, 167-68 

matter: origin of, explanation for, 403; states 
of, 306 

mattress model of scalar field theory, 4f, 
4-5; disturbing, 20, 20f; path integral 
description of, 16-18 

Maxwell action, 167 

Maxwell equations, magnetic charges and, 
222 

Maxwell Lagrangian, 30, 32, 79; bypassing, 
32-33; derivation of, 36 

Maxwell term, 298, 307 

Maxwell theory of electromagnetism: 
development of, 456-57; Yang-Mills 
theory compared with, 230 

mean field theory (Landau-Ginzburg theory), 
268-69; order in, 306 

Meissner effect, 271, 369 

Meissner Lagrangian, 310, 313 

meson(s): birth of, quantum field theory on, 


54-55, 55f; 1. See pion(s); 0, 319, 320; 
soliton compared with, 279; vector, 
field theory of, 30-31. See also massive 
spin 1 particle 

meson-meson scattering amplitude, 338; 
as analytic function, 189; cutoff 
dependence of, 158; dimensional 
analysis on, 158; divergence of, 145— 
46; path integral formulation of, 
150-51; regularization and, 147-48; 
renormalization and, 149-50 

Mills, Richard, and nonabelian gauge theory, 
226, 228 

Minkowskian path integral, 261 

momentum: fundamental definition of, 78; 
orbital angular, Dirac equation on, 177- 
78; spin angular, Dirac equation on, 
178 

momentum density, in nonrelativistic theory, 
173 

momentum space, 24; fermion propagator in, 
109; Feynman diagrams in, 53 

monopole, See magnetic charge 

Montonen, J., 313 

muon, weak decay of, 362 

Myrheim, J., 293 


Nambu, Yoichiro, 272, 452 

Nambu-Goldstone boson(s), 198-99; gapless 
mode as, 258; in massive gauge field, 
237-38, m mesons (pions) as, 205, 370, 
371; in relativistic vs. nonrelativistic 
theories, 259 

naturalness, notion of, 404 

Néel state, for antiferromagnet, 324 

Ne’eman, Y., SU (3) of, 469 

neutral current interaction, 366 

neutrino(s): handedness of, 98; mass of, 99 

neutrino masses, effective field theory of, 
44] 

neutron(s): B decay of, 202-3; electric dipole 
moment for, 232; and proton, internal 
symmetry of, 71 

Neveu, André, 386, 389 

Newton’s gravitational force: and Coulomb's 
electric force, comparison of, 27; 
derived from Einstein-Hilbert action, 
424~-25; quantum field theory on, 30, 
32-34 

Noether current, 173, 205 

Noether’s theorem, 73-74, 96, 319; elaborate 
formulation of, 74 

nonabelian Berry’s phase, 235, 325 

nonabelian gauge potential, 227, 228; 
coupling to fermion field, 233 
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nonabelian gauge theory(ies), 226-33; 
chiral anomaly in, 250; differential 
forms in, 228, 229; Feynman rules 
in, 473-74; gauge invariance in, 
preserving, 187; ghost action in, 

355; redundancy of, Faddeev-Popov 
approach to, 169; renormalizability of, 
158, 395; strong interaction described 
by, 232, 361; ’t Hooft double-line 
formalism and, 231-32; unsatisfactory 
features of, 456. See also Yang-Mills 
theory 

nonanalyticity: emergence of, 267; symmetry 
breaking and, 268 

noncommutative field theory, 456 

nonrenormalizable theory(ies), 154, 164, 
438; counterterms in, 164, 212; 
Einstein’s theory of gravity as, 157; 
Fermi’s theory of the weak interaction 
as, 155, 164 

nonrenormalization of the anomaly, 251 

notation, dotted and undotted, 113-14, 
479-82, replacing, 457 

nucleon(s): attraction between, 26; electron 
scattering off of, deep inelastic, 368- 
69; mass of, 319; and pions, interaction 
between, 318-19; wave function of 
quarks in, 368 


Olive, D. 1., 313 

orbital angular momentum, Dirac equation 
on, 177-78 

order parameters, 270 

orthogonal groups, embedding unitary 
groups into, 407-8 


Parisi, Giorgio, 385 

parity, 94; Dirac equation and, 94; and Dirac 
spinor, 114; weak interaction and, 96, 
361-62 

particle(s): birth and death of, 4—5; birth of, 
quantum field theory on, 54—55; field 
associated with, 24-25; force associated 
with, 25-27; interchanging, 293-94, 
294f; propagation of, describing, 46— 
47, 49; scattering of. See scattering of 
particles; sources and sinks for, 20. See 
also specific particles 

particle physics: and condensed matter 
physics, 255, 437-38; energy scales 
in, 154; family problem in, 413; 
spontaneous symmetry breaking in, 
267, 272, 435 

partition function, in quantum statistical 
mechanics, 262-63 


path integral formalism: vs. canonical 
formalism, 42, 61, 67; chiral anomaly 
and, 252; and classical limit, 19; 
derivation of, 42; description of 
mattress model, 16—18; Dirac on, 10~ 
13; Feynman on, 7-10; Grassmann 
math and, 125—26; history of, 59; 
integration measure in, 67; replacing, 
457; for spinor field, 221; and vacuum 
energy, calculation of, 121-23 

Pauli, Wolfgang, on spin-statistics 
connection, 118 

Pauli exclusion principle, 117, 301; history 
of, 117n 

Pauli matrices, 238 

Pauli-Hopf identity, 323 

Pauli- Villars regularization, 151-52 

Peierls, Rudolf, 347 

Peierls instability, 275 

pentagon anomaly, 250, 250f 

perturbation theory, 48-49; bare, 160; 
Feynman diagrams in, 54, 55f; 
finite temperature, 263; physical 
(renormalized/dressed), 160-61, 161f 

perturbative quantum gravity, 427 

¢' theory, renormalizability of, 158, 160 

phonon(s), 5, 258 

photon(s): absence of rest frame for, 168; 
birth and death of, 4; Bose-Einstein 
statistics for, 117; fluctuation into 
electron and positron, 183-85, 184f; 
force associated with, 27; longitudinal 
mode of, 141; spin of, 34 

photon propagation: charge as measure of, 
188; quantum fluctuations and, 183-85, 
184f 

photon propagator, 140-41; Fourier 
transform of, 190; physical 
(renormalized), 184, 184f 

physical perturbation theory, 160-61, 161f 

pion(s) (7 meson): massless, 206, 319; as 
Nambu-Goldstone boson, 205, 370, 
371; and nucleons, interaction between, 
318-19; prediction regarding, 27; 
quarks as components of, 368; weak 
decay of, 202-4 

pion-nucleon coupling constant, 206 

Planck mass: for (n + 3 + 1)-dimensional 
universe, 39-40; modified, 420 

Podolsky, B., 427 

Poincaré lemma, 219 

point particle: action of, constructing, 79-82; 
stress energy of, calculating, 82; world 
line traced out by, length of, 79, 80f 

Poisson equation, 425 
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polarization, degrees of, 32 

Politzer, H. D., on Yang-Mills theory, 369 

Polyakov, Sasha, on magnetic monopoles, 
285 

Polyakov action, 453 

Pontryargin index, 286 

positron(s): Dirac’s conception of, 5; photon 
fluctuation into, 183-85, 184f 

potential energy, double-well, 194, 194f 

power counting theorem, 161-63 

preons, theories about, 251-52 

propagation of particles, describing, 46-47, 
49 

propagator, 22—23; in canonical formalism, 
67, for Dirac field, 126; fermion, 109; 
graviton, 423-24; for massive spin 1 
particle, 33; for massive spin 2 particle, 
33, 426; photon, 140-41 

proton(s): charge of, grand unification on, 
394; electron scattering off of, deep 
inelastic, 341; electron scattering off of, 
Schrédinger equation for, 3; magnetic 
moment of, anomaly in, 439-40; and 
neutron, internal symmetry of, 71; 
quarks as components of, 368; stability 
of, 397 

proton decay: branching ratios for, 400- 
401, effective theory of, 440-41; grand 
unification and, 397~98, 400, 441; slow 
rate of, 403 


quantum chromodynamics (QCD), 341, 369; 
analytic solution of, search for, 375; at 
high energies, 375; large N expansion 
of, 377-79; renormalization group flow 
of, 371-72 

quantum electrodynamics (QED), 30; 
coupling constant of, 148; coupling 
in, 339, electromagnetic gauge 
transformation in, 171; Feynman rules 
for, derivation of, 135-41; intellectual 
incompleteness of, 118; Lagrangian for, 
97, 135; renormalizability of, 158 

quantum field theory(ies): in (0 + 0)- 
dimensional spacetime, 380; in 
2-dimensional spacetime, 453; 
anharmonicity in, 41, 85; asymptotic 
behavior of, study of, 340-41; central 
identity of, 167, 460; and condensed 
matter physics, 5, 172, 255; crisis of, 
202, 318, 437; in curved spacetime, 
77, 264; divergences in, 56-57, 145- 
46; Euclidean, 261-62, 263, 264; at 
finite density, 266; at finite temperature, 
263-64; gravity as, 420-22; ground 


state in, 196; harmonic paradigm 
and, 5; hidden structures in, 457; 
history of, 59; infinities in, 145-46; 
innovative applications of, 455-56, 
458; integral of, 84-85; low energy 
manifestation of, 146, 154, 437; 
mattress model and, 16—18; motivation 
for constructing, 54; need for, 3—5, 6, 
122; nonrelativistic limit of, 172-73, 
relativistic vs. nonrelativistic, 173-75; 
renormalizable vs. nonrenormalizable, 
154; on repulsion and attraction, 30~ 
34; restrictions within, 455, 456; strong 
and weak interactions applied to, 202; 
of strong interaction, 318; steps toward, 
206-7; supersymmetric, 443, 450; 
surface growth and, 327-28; symmetry 
breaking in, 195—96; theories subsumed 
by, 455; threshold of ignorance in, 146— 
47, 438; triumph of, 437, 455; vacuum 
in, 19 

quantum fluctuations: axial current 
conservation destroyed by, 247-48; 
effective potential generated by, 215; 
and electric charge, 188, 189; first order 
in, 210-11; higher order, and chiral 
anomaly, 286; and photon propagation, 
183-85, 184f; and symmetry breaking, 
200, 208, 214, 243 

quantum Hall fluid. See Hall fluid(s) 

quantum Hall system, 255 

quantum mechanics: and general relativity, 
marriage of, 6; harmonic oscillator in, 
solving, 41; Heisenberg’s approach 
to, 61-62; and magnetic monopoles, 
217; partition function in, 262-63; path 
integral formalism of, 7-13; quantum 
field theory as generalization of, 84-85, 
455; and relativistic physics, joining 
in spin-statistics connection, 119; and 
special relativity, marriage of, 3, 6, 
118; symmetry breaking in, 195-96, 
symmetry of, 243; time reversal in, 
99-100; and vector potential, need for, 
217 

quantum statistics, 117 

quantum vacuum, 340 

quark(s): color of, 368, 369; confinement 
of, 360, 369-70; in electroweak 
unification, 365; families of, 366; 
flavors of, 368; generations of, 413; 
and leptons, neutral current interaction 
between, 366; origins of concept, 207; 
strong interaction between, weakening 
of, 341 
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quasiparticle(s), 304; charge of, 305-6; 
fractional statistics and, 305, 306; as 
vortex, 306 


radiation: and atoms, interaction between, 3; 
Hawking, 264-65 

Ramakrishnan, T. V., 347 

Ramond, Pierre, and seesaw mechanism, 410 

tandom dynamics, and quantum physics, 328 

random matrix theory, 379-80; Feynman 
rules in, 381, 381f 

random potential, impurities and, 330 

Rarita-Schwinger equations, 116 

redundancy, Faddeev-Popov approach to, 
169-70 

reflection symmetry, 70, 197; breaking, 193, 
194, 195 

regularization, 147-48; dimensional, 151, 
152-53, 187; early taste of, 65; gauge 
invariance respected by, 185-87; 
Pauli- Villars, 151-52 

relativistic physics: equations of motion 
in, unified view of, 91; language of, 
24; and quantum physics, joining 
in spin-statistics connection, 119. 
See also general relativity; special 
relativity 

relativistic quantum field theory: correctness 
of, establishment of, 179; vs. 
nonrelativistic quantum field theory, 
173-75 

relevant operators, 345 

renormalizable conditions, imposing, 212-13 

renormalizable theory(ies), 154, 158, 438; 
electroweak theory as, 366; nonabelian 
gauge theory as, 158, 395; y* theory as, 
158, 160; Yukawa theory as, 163-64 

renormalization, 145, 148-50; coupling, 
158-59; of electric charge, 189, 190; 
field, 160; mass, 159; wave function, 
160 

renormalization group, 337, 339; and 
Anderson localization, 347-49; in 
condensed matter physics, 34144; and 
effective description, 349; effective 
field theory philosophy and, 438; 
in high energy physics, 340—41; in 
quantum chromodynamics, 371-72 

renormalization theory, application of, 
211-12 

renormalized coupling constant, 150 

renormalized (dressed) perturbation theory, 
160-61, 161f 

reparametrization invariance, 79 

replica method, 333-34 


representations: conventions for naming, 
462; multiplying, 468-69 

repulsion: of bosons, 174-75, 257, 316; 
quantum field theory on, 30~31; spin 1 
particle and, 35; of vortices, 316 

Ricci tensor, 419 

Riemann curvature tensor, 419 

Riemann-Christoffel symbol, 81 

Riemannian manifolds, differential geometry 
of, 428-30 

Rosenbluth, Marshall, 101 

rotation group, 111; and Lorentz group, 
symmetry of, 115 

R; gauge, 240, 241 


Salam, Abdus, 155; electroweak theory 
of, 365; superspace and superfield 
formalism of, 444, 445 

scalar boson operator, 109 

scalar field: Feynman rules for, 53-54, 471- 
72; quantizing in curved spacetime, 77; 
and vacuum energy, 65 

scalar field theory: classical field equation 
in, 19; Euclidean functional integral 
and, 261; Euclidean version of, 268; 
massless version of, 258 

scalar potentials, 217 

scattering of particles: describing, 49-52, 
50f, 51f; fermion-fermion, Feynman 
diagram for, 156, 157f£; meson-meson. 
See meson-meson scattering amplitude; 
reflection symmetry in, 70; and vacuum 
fluctuations, 122. See also electron 
scattering 

Schrieffer, Bob, 272 

Schrédinger equation: electromagnetic gauge 
transformation in, 171; Klein-Gordon 
equation and, 21n, 172; limitations of, 
3; Yang-Mills structure in, 234-35 

Schwarz, John, on string theory, 454n 

Schwarzschild black hole, 287-88 

Schwarzschild solution, for Hawking 
radiation, 264 

Schwinger, Julian: and effective potential, 
208; on Feynman’s contribution, 41, 49; 
on magnetic moment of electron, 179- 
81, 439; on path integral formalism, 
59; at Pocono conference (1948), 101; 
teaching style of, 439; Yang-Mills 
theory and, 361 

second-order phase transitions, 267 

seesaw mechanism, 410 

Seiberg, Nathan, 313 

self-dual theory, 316 

semions, 293 
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o meson, 319, 320 

o model, 318-19; for ferromagnets and 
antiferromagnets, 323, 324; nonlinear, 
320, 323 

Slansky, Dick, and seesaw mechanism, 410 

S-matrix theory, 207, 318 

SO (N). See special orthogonal group 

solid state physics, Dirac equation in, 273, 
274 

solitons (kinks): discovery of, 277-79, 
455; dynamically generated, 388-89; 
mass of, 280; topological stability of, 
279; unifying language for discussing, 
283 

sources and sinks, creating, 20, 49, 5O0f 

spacetime: curved. See curved spacetime; 
dimension of, and symmetry breaking, 
200; discretizing, 21; Feynman 
diagrams in, 53, 57; symmetry of, 
Lorentz invariance as, 70 

special orthogonal group SO (N), 461- 
64; SO (3), 462-64; SO (10) grand 
unification: antineutrino field in, 409- 
10; binary code in, 411-12; spinor 
representation of, 405—7, 409, 410; SO 
(18), 413 

special relativity, and quantum mechanics, 
marriage of, 3, 6, 118 

special unitary group SU (N), 464-68; 
decomposing representations of, 470; 
SU (2), 467-68; SU (2), of Heisenberg, 
469; SU (3), 467, 468; SU (3), of Gell- 
Mann and Ne’eman, 469; SU (5), 469; 
SU (5), Georgi and Glashow theory of, 
391-93 

spin angular momentum, Dirac equation on, 
178 

spin wave, 199 

spinor(s): Dirac, 90, 92, 111, 114; Majorana, 
99, 481; representations of, 113-14; 
Weyl, 114, 444 

spinor field: path integral for, 121; deriving, 
12426; path integral for, Grassmann 
numbers in, 122; vacuum energy of, 
107-8, 124 

spin-statistics rule, 117-18; and 
anticommutation relations, 119, 
121; price of violating, 118-20 

spontaneous symmetry breaking, 193-94, 
195, 197; continuous: and massless 
fields, 198—99; of reflection symmetry, 
195; in gauge theories, 236-38; 
in particle physics, 267, 272, 435; 
quantum fluctuations and, 200; in 
relativistic vs. nonrelativistic theories, 


259; second-order phase transitions 
and, 267; and superfluidity, 257-58 

square anomaly, 250, 250f 

steepest-descent approximation, 15 

Stoke’s theorem, 283 

Stoner, E. C., 117n 

Strathdee, J., superspace and superfield 
formalism of, 444, 445 

stress-energy tensor, 34; definition of, 78; of 
light beam, 430; properties of, 79 

string theory: as 2-dimensional field theory, 
452-53; as candidate for unified theory, 
419, 437; and cosmological constant 
problem, inability to resolve, 436, 
duality of, 313; Kaluza-Klein idea and, 
428; origins of, 6, 370; p-forms in, 223; 
in quantum field theory, 455; Schwarz 
on, 454n 

strong coupling: fixed point in, 340; linking 
to perturbative weak coupling, 455 

strong interaction: chiral symmetry of, 
205; currently accepted theory of, 
360; fundamental theory of, 341; 
hadronic, 35; at low energies, 318-19; 
nonabelian gauge theory on, 232, 361; 
quantum field theory of, 206-7, 318; 
renormalization group flow applied to, 
349; symmetries of, 205, 370-71 

SU (N). See special unitary group 

supercharges, 446 

superconductivity, 270-72; and Meissner 
effect, 271 

superconductor(s): monopole confinement 
in, 369-70; type II, flux tube in, 283 

superfield, 446-47; chiral, 447, 448; vector, 
448-49 

superfluidity, 174; gapless excitations and, 
258-59; Lagrangian summarizing, 258; 
linearly dispersing mode of, 258, 259, 
spontaneous symmetry breaking and, 
257-58 

superspace and superfield formalism, 444-45 

superstring theory, 454 

supersymmetric action, 448, 449 

supersymmetric algebra, 444-45 

supersymmetric field theories, 443; 
Yang-Mills, 375, 450 

supersymmetric method, 335 

supersymmetric transformation, total 
divergence under, 447-48 

supersymmetry, 108; Dirac spinor and, 111; 
inventing, 444; motivations for, 443 

surface growth, 326, 341; and quantum field 
theory, 327-28 

Swieca, Jorge, 201n 
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symmetry, 70-75; in amplitudes, 72; chiral, 
205, 370, 371, 404; classical vs. 
quantum, 243-44; continuous, 71— 
72, 197; breaking, 196; conserved 
current and, 73-74; in field theories, 
457; Grassmannian, 335; Heisenberg 
isospin, 370, 371; interchange, 71; 
internal, 71; power of, 18, 70, 115; 
reflection, 70, 197; replica, 334; of 
spacetime, Lorentz invariance as, 70; 
strong interaction, 205, 370-71, tensors 
and, 462, 465. See also supersymmetry 

symmetry breaking, 193-201; continuous 
symmetry and, 196; dimension of 
spacetime and, 200; dynamical, 201, 
371; in gauge theories, 236-38, 241, 
271; and nonanalyticity, 268; quantum 
fluctuations and, 200, 208, 214, 243; in 
quantum mechanics vs. quantum field 
theory, 195-96; reflection symmetry 
and, 193, 194, 195; and superfluidity, 
257-58; and vacuum energy, 435. 
See also spontaneous symmetry 
breaking 


*t Hooft, Gerardus, 158; on electroweak 
theory, 366; and large N expansion, 
377; on magnetic monopoles, 285 

*t Hooft double-line formalism, 231-32 

Teller, Edward, 101 

temperature: black hole, 264, 265; and cyclic 
imaginary time, 263; fimte, quantum 
field theory at, 263-64 

tensor(s): energy-momentum, 297; of 
orthogonal group, 461-62; Ricci, 
419: Riemann curvature, 419; stress- 
energy, 34, 78-79; of light beam, 430; 
symmetry properties of, 462, 465; 
of unitary group, 464-65; vacuum 
polarization, 183, 184f, 187 

tensor field, 34, 78 

@ term, 232 

Thomas precession, 112 

3-brane, 38-40 

time ordering, in canonical formalism, 67-68 

time reversal, 99-100; and Dirac equation, 
100-101 

Tolman, R., 427 

Tolman-Ehmefest-Podolsky effect, 427, 431 

Tomonaga, Shin-Itiro, 59 

topological current, 279 

topological field theory, 296-97 

topological objects, 282; discovery of, shock 
of, 287-88. See also specific objects 

topological order, 306 


topological quantum fluids, 300. See also 
Hall fluid 

total divergence, under supersymmetric 
transformation, 447-48 

trace, 462 

tree diagrams, 44 

Treiman, Sam, 207. See also Goldberger- 
Treiman relation 

triangle anomaly, 244f, 250 


ultraviolet catastrophe, 434 

ultraviolet divergence, 146 

uncertainty principle, 3, 264 

unification. See grand unification 

unitary gauge, 241 

unitary groups, embedding into orthogonal 
groups, 407-8 

universe: early, 264; formation of structure 
in, 35; 3-brane, 38-40 


vacuum: disturbing of, 20, 20f; disturbing of, 
Casimir effect of, 65-67; quantum, 19, 
340 

vacuum energy: calculation of, using path 
integral formalism, 121-23; of free 
scalar field, 64—65; of free spinor field 
(Dirac field), 107-8, 124; Grassmann 
path integral for, 126; symmetry 
breaking and, 435 

vacuum expectation value, 196 

vacuum fluctuations, 57-59, 59f; Feynman 
diagram corresponding to, 128; 
scattering of particles and, 122 

vacuum polarization tensor, 183, 184f, 187 

van Dam, H., 426 

van der Waerden notation. See dotted and 
undotted notation 

vector field, interacting with Dirac field, 
97; Feynman rules for, 128, 128f, 
472-73 

vector meson (massive spin 1 meson): field 
theory of, 30-31. See also massive spin 
1 particle 

vector potential, 217 

vector superfield, 448-49 

Veltman, Torny, 158, 426; on electroweak 
theory, 366 

vielbeins, 429 

visual perception, application of field theon 
to, 458 

vortex (vortices), 282, 309-10; as charges 
in dual theory, 310-12; density of. 
311; duality of, 312; as flux tube. 
283: motion in fluid, 316-17, 317f: 
paired with antivortex, 286-87, 317. 
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vortex (cont.) 
quasiparticle as, 306; repulsion of, 316 


Ward-Takahashi identity, 140, 395 

wave function(s), Anderson localization of, 
330-31, 334 

wave function renormalization, 160 

wave packets, in mattress model, 4f, 
4-5 

weak interaction, 35; intermediate vector 
boson of, 156-57, 285; and parity, 96, 
361-62; quantum field theory applied 
to, 202. See also Fermi theory of the 
weak interaction 

weak interaction Lagrangian, 96 

Weinberg, Steve, 155; electroweak theory of, 
365 

Weisskopf phenomenon, 165-66; grand 
unification and, 403 

Wen, Xiao-gang, 301, 307; and topological 
order, 306 

Wentzel, G., 101 

Wess-Zumino model, 444 

Wey] basis, 95-96, 115 

Weyl spinors, 114; and supersymmetry, 
444 

Wheeler, John, 347n 

Wick, Gian Carlo, 13 

Wick contractions, 13-14, 15, 45 

Wick rotation, 12, 261 

Wick theorem, 13 

Wigner, Eugene: on antisymmetric wave 
function of electron, 103; and law of 
baryon number conservation, 397; and 
random matrix theory, 379; on time 
reversal, 99 

Wigner semi-circle law, 380-84 

Wilczek, Frank, 293, 294; on Yang-Mills 
theory, 369 


Wilson, Ken, 145; and complete theory 
of critical phenomena, 269; and 
effective field theory approach, 437; 
and lattice gauge theory, 356-58; and 
renormalization groups, 342 

Wilson loop, 235; in lattice gauge theory, 
358-60; and quark confinement, 369 

Witten, Ed, 313 

Wu, Yang-shi, 220 


Yanagida, T., and seesaw mechanism, 410 

Yang, Chen-Ning, 96, 101, 220; and 
nonabelian gauge theory, 226, 228 

Yang-Mills bosons, 230, 369; self-interaction 
of, 420 

Yang-Mills coupling constant, 231-32 

Yang-Mills Lagrangian, 230 

Yang-Mills theory, 230-31; area law in, 360; 
asymptotically free, 369; Einstein’s 
theory of gravity compared with, 
430; Einstein-Hilbert action compared 
with, 420-21; Feynman rules in, 
230, 231f, original response to, 353, 
361; perturbative approach to, 356; 
quantizing, 353-55; Schrédinger 
equation and, 234-35; supersymmetric, 
375, 450; Wilson formulation of, 
356-58 

Young tableaux, 462 

Yukawa, H., 26-27, 156 

Yukawa coupling, 155 

Yukawa theory, renormalizability of, 163-64 


Zakharov, V., 426 

Zee, A., 294 

Zhang, Shou-cheng, 302 
Zinn-Justin, Jean, 158 
Zuber, Jean-Bernard, 385 
Zumino, Bruno, 118, 453 


